Note: Currently new registrations are closed, if you want an account Contact us
Difference between revisions of "SMC/SoC/2008"
(8 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
[[SMC/SoC/2007|SMC in Google Summer of Code 2007]] | |||
[[User:Santhosh|Santhosh Thottingal]] will be lead admin this year, and will deal with the administrative stuff with Google for SMC to be a mentoring organisation | |||
==Ideas for Google Summer of Code 2008== | ==Ideas for Google Summer of Code 2008== | ||
===Tokenizer/Lemmatiser for malayalam for GATE=== | ===Tokenizer/Lemmatiser for malayalam for GATE=== | ||
Write a Lemmatiser for Malayalam. See whether we can do a plugin for GATE for malayalam, that would help NLP reasearchers a lot and that would be a great idea. | Write a Lemmatiser for Malayalam. See whether we can do a plugin for GATE for malayalam, that would help NLP reasearchers a lot and that would be a great idea. Google search GATE,download and install GATE , and in the plugins directory a hindi tokenizer and lemmatiser is available. | ||
=== Functional Optical character Recognition system=== | === Functional Optical character Recognition system=== | ||
Add malayalam Support for tesseract OCR. | Add malayalam Support for tesseract OCR. | ||
Line 18: | Line 22: | ||
===Rewrite the Dhvani sound system with SDL=== | ===Rewrite the Dhvani sound system with SDL=== | ||
#Rewrite the ALSA sound system of | #Rewrite the ALSA sound system of [[Dhvani|Dhvani]] with [http://www.libsdl.org/ SDL] to make it a cross platform application | ||
#Packaging for different platforms | #Packaging for different platforms | ||
#Bug fixes for langauge modules and Code clean up | #Bug fixes for langauge modules and Code clean up | ||
#Adding pitch/volume/pause support for the generated speech | #Adding pitch/volume/pause support for the generated speech | ||
===Localization of Free Content Management Systems to Malayalam-Drupal | ===Localization of Free Content Management Systems to Malayalam-Drupal &Joomla === | ||
100% localization of Drupal and Joomla CMS systems to Malayalam | 100% localization of Drupal and Joomla CMS systems to Malayalam | ||
===Speech recognition system for Malayalam=== | |||
#Develop a speech recognition system for Malayalam using the concepts of memory prediction framework | |||
==How to Apply == | ==How to Apply == | ||
see http://code.google.com/soc/2008/faqs.html | #see http://code.google.com/soc/2008/faqs.html | ||
#[http://wiki.debian.org/SummerOfCode2008/StudentApplicationTemplate Student Application Template] | |||
==Selection procedure == | ==Selection procedure == | ||
http://code.google.com/soc/2008/faqs.html | |||
==Guidelines for Students == | ==Guidelines for Students == | ||
==Guidelines for Mentors == | ==Guidelines for Mentors == | ||
[http://www.gnome.org/~federico/docs/summer-of-code-mentoring-howto/index.html Summer of Code Mentoring HOWTO] |
Revision as of 19:21, 12 March 2008
SMC in Google Summer of Code 2007
Santhosh Thottingal will be lead admin this year, and will deal with the administrative stuff with Google for SMC to be a mentoring organisation
Ideas for Google Summer of Code 2008
Tokenizer/Lemmatiser for malayalam for GATE
Write a Lemmatiser for Malayalam. See whether we can do a plugin for GATE for malayalam, that would help NLP reasearchers a lot and that would be a great idea. Google search GATE,download and install GATE , and in the plugins directory a hindi tokenizer and lemmatiser is available.
Functional Optical character Recognition system
Add malayalam Support for tesseract OCR.
- Study tesseract OCR system
- Recogntion of all characters
- Layout recogization using ocropus (optional ?)
http://code.google.com/p/tesseract-ocr/ http://code.google.com/p/ocropus/
Write a Gnome Speech Driver for Dhvani and Integrate it with Orca
- Orca for visually impaired users uses gnome speech for speech engines. Currently Festival, Espeak, freetts etc have drivers for gnome speech. We need to write a driver for dhvani.
- Develop plugins for KTTS/Gedit/Firefox
Rewrite the Dhvani sound system with SDL
- Rewrite the ALSA sound system of Dhvani with SDL to make it a cross platform application
- Packaging for different platforms
- Bug fixes for langauge modules and Code clean up
- Adding pitch/volume/pause support for the generated speech
Localization of Free Content Management Systems to Malayalam-Drupal &Joomla
100% localization of Drupal and Joomla CMS systems to Malayalam
Speech recognition system for Malayalam
- Develop a speech recognition system for Malayalam using the concepts of memory prediction framework
How to Apply
Selection procedure
http://code.google.com/soc/2008/faqs.html