CLARIN (Common Language Resources and Technology Infrastucture) is a research infrastructure of language resources and its technology.
It offers some services to the different communities of linguists, the humanities scholars and to the society.
The mission of CLARIN to create a Pan European infrastructure that will boost humanities research in a multicultural and multilingual area as Europe is and facilitate a multilingual and multicultural education in schools, colleges and universities.
By language resources and its technology they mean all knowledge sources based on written or spoken language and the tools to carry out operations on such language material. There are some characteristics about what CLARIN language resources and technologies are:
- Texts of all sorts which can be digitized medieval sources, web-sites, newspapers, digitized books etc.
- Multimedia recordings (audio/video) and time series recorded during communication like data glove or eye tracking.
- Various types of manually or automatically created annotations on texts, media streams etc.
- Tools such as aligners, speech recognizers, tokenizers, part-of-speech taggers, parsers, manual annotators, viewers etc.
- Various types of knowledge sources encapsulating knowledge about resources and languages such as metadata descriptions, GIS, lexica, concept registries, ontologies etc.
The Language Resource and Technology community is working on the content of the resources and apply specific tools tailored to their research interests, the language resource community is interested in linguistic content, structures, formal semantics etc.
Common Language Resources and Technology Infraestructure (CLARIN). (2008). Retrieved March 31, 2008 from http://www.clarin.eu/