orangetcy / pre-modern_chinese_corpus_dataset Goto Github PK
View Code? Open in Web Editor NEWThis project forked from jiangyanting/pre-modern_chinese_corpus_dataset
一个近代汉语语料库数据集 This is a pre-modern Chinese ( From Song dynasty in 10th century AD to Republic of China in the early 20th Century ) language corpus.These language resources are all txt format,arranged by Dynasty(Song,Yuan,Ming,Early-Qing,Late-Qing and Republic of China).The relevant authors' information and types of literature also have been labelled.