Scraping all the texts of Luxun(鲁迅) from the Internet using Python (用Python爬取《鲁迅全集》)
I want to do some text mining practices on the texts of Luxun(鲁迅), a great Chinese writer. The first step is to get all the texts by Luxun, and I have no time typing all the texts word by word. So I decided to srape the texts from an online source. Source of the texts The texts of Luxun are scraped from 子夜星网. As it claimed, it contains all the texts in the Complete works of Luxun(鲁迅全集). I checked it, and so it did.
2019-10-12
2 min read
Writing academic papers using Sublime Text 3 + Pandoc
This post explains how to write academic papers using Sublime Text 3 in markdown format and transform the .md file into .docx and pdf files using pandoc. Prerequisites To write academic papers in markdown format requires a markdown editor, a format trnasformer, maybe also a literature manager. I use Sublime Text 3 as the markdown edit, Pandoc as the format transformer, and Zotero as the literature manager. This section lists the things we need. Sublime Text 3 and plugins Sublime Text 3 is a text editor that can be used for code, markup and prose. It is lightweight and is suitable for long texts. The followings are several plugins (or packages) we need:
2019-10-11
2 min read
中国的《伊利亚特》
希腊的盲诗人荷马写下了《伊利亚特》,以阿喀琉斯的愤怒为线索,讲述了特洛伊战争最后一年的战况;而这场战争是由美女海伦引起的。在同一时期的中国,著名的盲人史学家左丘明也记录了数场由美女夏姬引发的战役,而以申公巫臣的愤怒为线索。在此之前,还有详细的背景交代,充满戏剧色彩,称得上是《左传》中最惊心动魄的故事之一。
2013-08-04
4 min read