微软亚洲研究院中文分词语料_icwb2-data

时间:2021-04-29 12:36:17
【文件属性】:

文件名称:微软亚洲研究院中文分词语料_icwb2-data

文件大小:40.82MB

文件格式:RAR

更新时间:2021-04-29 12:36:17

中文分词语料

微软亚洲研究院中文分词语料库_自然语言处理_科研数据集


【文件预览】:
icwb2-data
----training()
--------msr_training.txt(12.25MB)
--------as_training.utf8(38.86MB)
--------cityu_training.utf8(8.15MB)
--------msr_training.utf8(16.11MB)
--------pku_training.txt(5.63MB)
--------cityu_training.txt(5.94MB)
--------as_training.b5(26.36MB)
--------pku_training.utf8(7.37MB)
----testing()
--------cityu_test.txt(133KB)
--------as_test.utf8(604KB)
--------pku_test.txt(335KB)
--------msr_test.utf8(547KB)
--------as_test.txt(412KB)
--------msr_test.txt(367KB)
--------pku_test.utf8(498KB)
--------cityu_test.utf8(197KB)
----README(2KB)
----doc()
--------result_instructions.txt(4KB)
--------instructions.txt(7KB)
----scripts()
--------mwseg.pl(3KB)
--------score(7KB)
----gold()
--------cityu_training_words.utf8(571KB)
--------cityu_test_gold.utf8(235KB)
--------msr_training_words.txt(723KB)
--------msr_test_gold.txt(569KB)
--------as_testing_gold.utf8(920KB)
--------as_training_words.utf8(1.33MB)
--------pku_training_words.utf8(479KB)
--------cityu_test_gold.txt(171KB)
--------pku_test_gold.utf8(701KB)
--------cityu_training_words.txt(412KB)
--------pku_test_gold.txt(539KB)
--------as_testing_gold.txt(624KB)
--------msr_training_words.utf8(1.02MB)
--------as_training_words.txt(951KB)
--------pku_training_words.txt(339KB)
--------msr_test_gold.utf8(749KB)

网友评论

  • 感谢分享,下载下来试试
  • 不错是我想要找的资源