Note: Currently new registrations are closed, if you want an account Contact us
Difference between revisions of "SMC/SoC/2008"
(No difference)
|
Revision as of 19:21, 12 March 2008
SMC in Google Summer of Code 2007
Santhosh Thottingal will be lead admin this year, and will deal with the administrative stuff with Google for SMC to be a mentoring organisation
Ideas for Google Summer of Code 2008
Tokenizer/Lemmatiser for malayalam for GATE
Write a Lemmatiser for Malayalam. See whether we can do a plugin for GATE for malayalam, that would help NLP reasearchers a lot and that would be a great idea. Google search GATE,download and install GATE , and in the plugins directory a hindi tokenizer and lemmatiser is available.
Functional Optical character Recognition system
Add malayalam Support for tesseract OCR.
- Study tesseract OCR system
- Recogntion of all characters
- Layout recogization using ocropus (optional ?)
http://code.google.com/p/tesseract-ocr/ http://code.google.com/p/ocropus/
Write a Gnome Speech Driver for Dhvani and Integrate it with Orca
- Orca for visually impaired users uses gnome speech for speech engines. Currently Festival, Espeak, freetts etc have drivers for gnome speech. We need to write a driver for dhvani.
- Develop plugins for KTTS/Gedit/Firefox
Rewrite the Dhvani sound system with SDL
- Rewrite the ALSA sound system of Dhvani with SDL to make it a cross platform application
- Packaging for different platforms
- Bug fixes for langauge modules and Code clean up
- Adding pitch/volume/pause support for the generated speech
Localization of Free Content Management Systems to Malayalam-Drupal &Joomla
100% localization of Drupal and Joomla CMS systems to Malayalam
Speech recognition system for Malayalam
- Develop a speech recognition system for Malayalam using the concepts of memory prediction framework
How to Apply
Selection procedure
http://code.google.com/soc/2008/faqs.html