SMC/SoC/2008: Difference between revisions
Swathantra Malayalam Corpus |
No edit summary |
||
| Line 15: | Line 15: | ||
* In the first phase need to build a specification document, clearly written manual for building the corpus and should build the tools needed to build the corpus and use the corpus. | * In the first phase need to build a specification document, clearly written manual for building the corpus and should build the tools needed to build the corpus and use the corpus. | ||
* Anybody who like to contribute to the project must be able to do so and the specifications should be of the best covering all the aspects on classification of data, annotation of data, structure of storage and all related details. | * Anybody who like to contribute to the project must be able to do so and the specifications should be of the best covering all the aspects on classification of data, annotation of data, structure of storage and all related details. | ||
* As a part of the project, when we finish the summer, we must be able to build a complete specification document and programs to build the corpus and access the corpus(building the whole process must be a collaborative effort, it is not coming under this phase). | * As a part of the project, when we finish the summer, we must be able to build a complete specification document(document, explanations,related presentations, demo files etc.) and programs to build the corpus and access the corpus(building the whole process must be a collaborative effort, it is not coming under this phase). | ||
* More importantly, the structure should be an extensible one for all indic languages. | |||
Please add more details that can be added to a corpora project. | Please add more details that can be added to a corpora project. | ||