Wolff's Cebuano Dictionary Available on Bohol.ph

IJsselstein, Tuesday, 8 May 2012 04:45:42

After about four years of proof-reading at Distributed Proofreaders, the digital edition of John U. Wolff's Dictionary of Cebuano Visayan is now nearing completion. This dictionary of over 1200 pages was first published in 1972, and is one of the most comprehensive Cebuano dictionaries available. For two years, an experimental interface to the raw dictionary data has been available on this site, but today we are ready to show a new interface to the fully checked and tagged text version of this dictionary.

John Wolff spend about 10 years producing this dictionary from scratch. With a team of local assistants, he collected words from actual spoken conversations and print publications in the old-fashioned way: using cards to note down each word and its usage. This, way, the dictionary reflects the language as it was in the sixties. Some of it strengths for foreign learners are that it includes sample sentences with most entries, uses accents to help with the correct pronunciation, and identifies plants and animals with their scientific names.

For Cebuano speakers, some aspects of this dictionary may make it a little bit harder to use. First of all, the orthography will take some time to get used to, as Wolff resolutely purged the e and o from the alphabet, using i and u, respectively, in their place. Second, many of the translations, especially those of the sample sentences, the author uses American idiom in an attempt not only to translate the literal meaning, but also the connotation of the usage. Reading those will actually also help you improve your English!

Finally, this dictionary also indicates, through a system of codes, the various possible uses of verbs. This system, however, requires a careful reading of the introduction of the dictionary.

This project wouldn't have been possible without the help of countless volunteers proofreading the data, and, even more important the huge effort John Wolff put into compiling this work, and the generosity of his publisher to place this work in the public domain.

All raw data and scripts used to produce this dictionary are available from Google Code.

You are now all invited to play around with the interface here, use it as you think it is useful, and tell us about your experiences in the comments below this article. All your ideas and criticism are welcome.

If you have an account with Bohol.ph, you can also log in and leave notes with each entry, which you can make public if you like (note that the site administrators can read all such notes, even if not made public)

Some Numbers...

Number of entries: 21761
Number of Cebuano words: 264485 (69604 distinct)
Number of English words: 623159 (25944 distinct)
Total number of words: 961801
Size of master file: 7.64 MB

Known Issues

Highlighting of the search term does only work when the word found is an exact match with the search term. (Need to implement regular expression matching in high-lighting code.)

Double accented letters appear with the top accent after the letter. (Limitation of font used in combination with use of Unicode combined diacritics.)

Jeroen Hellingman