Foreign word detection in mlmorph

The test corpus for Malayalam Morphological analysis has many foreign words. They are either written in a non-Malayalam script or written in Malayalam. For example, “ഇലക്ട്രിസിറ്റി”, “ഡോക്

Markov chain for Malayalam

This was originally written by Santhosh Thottingal and published at have been trying to generate a Markov chain for Malayalam content. A  Markov chain is a stochastic model describing a