Integrate Chinese Character Frequency Counter #110

baimafeima · 2018-08-23T15:39:14Z

It would be great to have the ability to paste random Chinese text into a field/box as part of Syng and get a Chinese character frequency count upon clicking a button. This would allow to quickly identify the most important characters to learn from particular Chinese texts and to efficiently prepare for exams for any college student.

https://czielinski.github.io/hanzifreq/hanzifreq/output/frequencies.html
See: https://github.com/czielinski/hanzifreq

These scripts allow the analysis of character frequencies in Chinese text corpora. This might be helpful for Chinese language learners to prioritize common characters when learning how to write.

sotch-pr35mac · 2018-08-23T18:44:43Z

That sounds like it could be a pretty helpful tool! So the feature would be to paste in some arbitrary block of Chinese text and get frequency data back from it about which characters are most frequently used?

baimafeima · 2018-08-24T08:56:28Z

Yes, exactly. I think Syng would be a great choice for that, especially since Hanzifreq is a terminal-based program without a suitable frontend for it.

sotch-pr35mac · 2018-08-26T23:47:13Z

I wouldn’t be able to include the actual hanzifreq script but I would definitely be able to build a tool that does something similar. My question is: would we want just character frequency or word frequency?

baimafeima · 2018-09-29T11:43:58Z

My question is: would we want just character frequency or word frequency?

I think character frequency would be the feature I would most often use. How would you approach word frequency?

sotch-pr35mac · 2018-11-08T00:15:49Z

First the text would be tokenized and then count the frequency of the tokenized words.

baimafeima mentioned this issue Aug 23, 2018

Solus packaging problems czielinski/hanzifreq#4

Closed

sotch-pr35mac added feature-request good-first-issue labels Aug 23, 2018

sotch-pr35mac added this to the v1.5.0 milestone Aug 23, 2018

sotch-pr35mac added the onhold label Aug 23, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate Chinese Character Frequency Counter #110

Integrate Chinese Character Frequency Counter #110

baimafeima commented Aug 23, 2018

sotch-pr35mac commented Aug 23, 2018

baimafeima commented Aug 24, 2018

sotch-pr35mac commented Aug 26, 2018

baimafeima commented Sep 29, 2018

sotch-pr35mac commented Nov 8, 2018

Integrate Chinese Character Frequency Counter #110

Integrate Chinese Character Frequency Counter #110

Comments

baimafeima commented Aug 23, 2018

sotch-pr35mac commented Aug 23, 2018

baimafeima commented Aug 24, 2018

sotch-pr35mac commented Aug 26, 2018

baimafeima commented Sep 29, 2018

sotch-pr35mac commented Nov 8, 2018