SMC Monthly Report: September 2024
EventsSMC WebsiteSMC new website launched in Bengaluru. Riya Sabu volunteered to initiate the new website,
Santhosh Thottingal will be presenting his work “Parametric type design in the era of variable and color fonts" at G21C (Grapholinguistics in the 21st Century, also called /gʁafematik/), Venice, Italy durimg October 2024. Read more on the design process here.
He will be presenting his design experiments using METAPOST for Nupuram and Malini typefaces that are variable fonts.
Malayalam Pronunciation Dictionary aka Malayalam Phonetic Lexicon curated by Kavya Manohar as part of her PhD, is now available in Huggingface hub as a dataset. It gives Phonemic transcription of Malayalam words in IPA format.
This is a collection of Malayalam words and their pronunciation described in IPA format. The pronunciations has been automatically generated using [Mlphon] (https://pypi.org/project/mlphon/) Python library. The Malayalam words in this dataset are categorized into: Common words (ordered by frequency of occurrence in Indic-NLP-Corpus), Verbs, Nouns, English words
Nouns of Sanskrit origin, Proper nouns, Pronouns, Person names and Place names. The commonwords are ordered by frequency of occurrence in Indic-NLP-Corpus. All other categories of words were derived from curated collection of words in Mlmorph.
Santhosh Thottingal published a new dataset, 'A Day in History' on Huggingface. This is a dataset prepared out of wikipedia pages like https://en.wikipedia.org/wiki/Category:Days_of_the_year.
You can try out a demo here. This is available in Malayalam and English.
Source code: https://github.com/santhoshtr/day-in-history/