Language Capabilities
The Teneo Platform currently offers Natural Language Processing (NLP) Capabilities and development of conversational AI solutions in 86 languages. This means that the Platform works with these languages, understands their alphabets and characters, understands what a word and a sentence are in each of those languages, and is able to normalize and pre-process text (e.g., tolerate misspellings and abbreviations).
More specifically it also means that the Teneo Platform supports creating Flows, Language Objects, Entities, and Classes, using the Teneo Teneo Linguistic Modeling Language for syntax writing, creating Integrations and other global elements, and also the use of revisitability, flow stacks, dialogue history, and so on.
In the Teneo Platform, the NLP Capabilities are provided through the use of Input Processors (IPs) which pre-process inputs for the Teneo Engine to be able to perform different processes on them, such as normalization, tokenization, and spelling correction, or even number recognition and language detection.
Each language supported by the Teneo Platform has a chain of Input Processors that know how to process that particular language, most of the available languages share the Standard Input Processors Chain with different configuration schemes, while separate Input Processors Chains are available for Chinese, Japanese, Korean, and Turkish.
On top of the NLP Capabilities, further Natural Language (NL) Analyzers are available for a number of the languages. The NL Analyzers group together additional resources available in the Teneo Platform: these resources provide additional information about evaluated inputs, but they are not required for a language to function in the Teneo Platform. The Teneo NL Analyzers, among others, include support for understanding and interpretation of date and time expressions, Part-of-Speech (POS) and Morphological annotation, or recognition of Named Entities.
Last but not least, The Teneo Platform also offers Pre-built Knowledge in the formats of Teneo NLU Ontology and Semantic Networks together with Conversational Modules in a number of the supported languages; Pre-built Knowledge is considered relevant for any project domain and any type of solution because it enables developers to get up and running with a new solution quicker as it allows the project team to focus only on project or domain specific needs. Read more about Pre-built Knowledge here.
The below table provides an overview of the supported languages in the Teneo Platform.
Language | NLP Capabilities | NL Analyzers | Pre-built Knowledge |
---|---|---|---|
Afrikaans | ✔ | ||
Albanian | ✔ | ||
Amharic | ✔ | ||
Armenian | ✔ | ||
Azerbaijani | ✔ | ||
Basque | ✔ | ||
Belarusian | ✔ | ||
Bengali (Bangla) | ✔ | ||
Bosnian | ✔ | ||
Bulgarian | ✔ | Ontology and Semantic Network ✔ Conversational Modules ✔ | |
Catalan | ✔ | ||
Chinese (Mandarin) | ✔ | POS and Morphology ✔ Date&Time ✔ Sentiment ✔ | Ontology and Semantic Network ✔ Conversational Modules ✔ |
Croatian | ✔ | ||
Czech | ✔ | Ontology and Semantic Network ✔ | |
Danish | ✔ | POS and Morphology ✔ Date&Time ✔ | Ontology and Semantic Network ✔ Conversational Modules ✔ |
Dutch | ✔ | POS and Morphology ✔ Date&Time ✔ Sentiment ✔ | Ontology and Semantic Network ✔ Conversational Modules ✔ |
English | ✔ | POS and Morphology ✔ NERs ✔ Date&Time ✔ Sentiment ✔ Offensive ✔ | Ontology and Semantic Network ✔ Conversational Modules ✔ |
Esperanto | ✔ | ||
Estonian | ✔ | ||
Ewe | ✔ | ||
Faroese | ✔ | ||
Finnish | ✔ | ||
French | ✔ | POS and Morphology ✔ NERs ✔ Date&Time ✔ | Ontology and Semantic Network ✔ Conversational Modules ✔ |
Frisian | ✔ | ||
Galician | ✔ | ||
Georgian | ✔ | ||
German | ✔ | POS and Morphology ✔ NERs ✔ Date&Time ✔ Sentiment ✔ | Ontology and Semantic Network ✔ Conversational Modules ✔ |
Greek | ✔ | ||
Gujarati | ✔ | ||
Hindi | ✔ | ||
Hungarian | ✔ | ||
Icelandic | ✔ | ||
Igbo | ✔ | ||
Indonesian | ✔ | Ontology and Semantic Network ✔ Conversational Modules ✔ | |
Irish | ✔ | ||
Italian | ✔ | POS and Morphology ✔ NERs ✔ Date&Time ✔ Sentiment ✔ | Ontology and Semantic Network ✔ Conversational Modules ✔ |
Japanese | ✔ | POS and Morphology ✔ NERs ✔ Date&Time ✔ | Ontology and Semantic Network ✔ Conversational Modules ✔ |
Kazakh | ✔ | ||
Kinyarwanda | ✔ | ||
Kirundi (Rundi) | ✔ | ||
Korean | ✔ | POS and Morphology ✔ | |
Kyrgyz | ✔ | ||
Latvian | ✔ | ||
Lithuanian | ✔ | ||
Luxembourgish | ✔ | ||
Macedonian | ✔ | ||
Malagasy | ✔ | ||
Malay | ✔ | Ontology and Semantic Network ✔ Conversational Modules ✔ | |
Maltese | ✔ | ||
Marathi | ✔ | ||
Mongolian | ✔ | ||
Nepali | ✔ | ||
Norwegian (Nynorsk/Bokmål) | ✔ | Date&Time ✔ | Ontology and Semantic Network ✔ Conversational Modules ✔ |
Odia | ✔ | ||
Oromo | ✔ | ||
Papiamento | ✔ | ||
Polish | ✔ | ||
Portuguese (European/Brazilian) | ✔ | Date&Time ✔ | Ontology and Semantic Network ✔ Conversational Modules ✔ |
Quechuan (Quechua) | ✔ | ||
Romanian | ✔ | ||
Romansh | ✔ | ||
Russian | ✔ | Ontology and Semantic Network ✔ Conversational Modules ✔ | |
Sámi | ✔ | ||
Sango | ✔ | ||
Scottish Gaelic | ✔ | ||
Serbian | ✔ | ||
Shona | ✔ | ||
Sinhala | ✔ | ||
Slovak | ✔ | ||
Slovene | ✔ | ||
Somali | ✔ | ||
Spanish | ✔ | POS and Morphology ✔ NERs ✔ Date&Time ✔ | Ontology and Semantic Network ✔ Conversational Modules ✔ |
Swahili (Kiswahili) | ✔ | ||
Swazi | ✔ | ||
Swedish | ✔ | POS and Morphology ✔ NERs ✔ Date&Time ✔ Sentiment ✔ | Ontology and Semantic Network ✔ Conversational Modules ✔ |
Tigrinya | ✔ | ||
Tsonga | ✔ | ||
Tswana (Setswana) | ✔ | ||
Turkish | ✔ | POS and Morphology ✔ NERs ✔ | Ontology and Semantic Network ✔ Conversational Modules ✔ |
Turkmen | ✔ | ||
Ukrainian | ✔ | ||
Uzbek | ✔ | ||
Vietnamese | ✔ | ||
Welsh | ✔ | ||
Yoruba | ✔ | ||
Zulu (isiZulu) | ✔ |