Launch Radmap v.6 (requires Flash Player 7)
project description
Radmap is an online visualizing dictionary for chinese language. Within the animation, 6743 most common Chinese characters are added to a bar chart, one by one, which has 186 bars, each representing a chinese radical. These radicals are used to categorize characters, for example in dictionaries. The sequence starts with the most common character and ends with the least used. One at a time, characters get attached to their radical(s) in the chart, making the bars grow. Clicking on a bar displays all the characters belonging to that radical, together with the radical's etymology and character's translations and pronounciations. A small search is included within every bar.
objectives
The project focuses on the development of an online Chinese language dictionary.
The aim of the research is to create more sophisticated front-end interfaces that enable visualizations of relations and connections between different words (Chinese characters), instead of simply following the 'search and find' approach. Our goal is to amplify and deepen the user's understanding of Chinese language.
In the late stage of development, Radmap is about to be a stand-alone, java application.
important notes
- Most of the data is freely available from the sources mentioned below. None of this information has been proofed, so might or might not be reliable.
- The 'frequency of usage' is measured as the number of results returned from the Google search engine. This means that characters typically found in spoken language are under-represented, whereas characters specific for written language and business and computers will be over-represented.
sources for the data
- list of characters — almost identical to the Chinese characters in GB 2312-80, which is the most common character set for Chinese web pages.
- radicals — adapted from the list by Ngai Kim Hoong.
- frequency of usage — obtained by passing every character to a perl script returning the number of search results. Uses the Google web api.
- translations — obtained from the Unihan database using the Unicode::Unihan perl module by Dan Kogai (and the data for those two Unihan fields is taken from the CEDICT).
- pinyin — obtained from the zidianf file by Adrian Robert.
- sounds files for the pinyin — adapted from .wav files from an unknown source.
This project is supported by the Jan van Eyck Academie, Maastricht, NL.