Curva Fin Bloque

ActivaTM: Matrix, Scalable, Infinite Language Database

With ActivaTM, you can create a server version of all your translation memories and bilingual language assets, no matter what Computer Assisted Translation (CAT) tool you use.

Translators can translate and access each other’s work simultaneously. But, this time, it does not matter which CAT tool they are using.

Curva Fin Bloque Negativa

ActivaTM is the most Advanced Multi-Channel, Multi-Format and Multi-Language Translation Memory System. It is based on Elastic Search Technologies.

Because of its Elastic-Search open-source origins, it has been designed in order to supersede the expensive model of server-based CAT tool providers. ActivaTM is indeed a server translation memory, but with no strings attached. As reported by Jost Zetsche in his Tool Box Journal (, ActivaTM brings the first kind of real innovation to the CAT tool Translation Memory environment in decades. In short, ActivaTM takes away the need for every Computer Assisted Tool to have its own Translation Memory

ActivaTM has been selected as a CEF (Connecting Europe Facility) project by the European Commission as the database of choice to provide unified translation memory services and collect and build bilingual Big Data from public translation contracts.

You can install ActivaTM on your own server or in the cloud as a SaaS, populate it with all your own translation memories and public ones and leverage full and subsegment matches – it does not matter which translation tool created the TMX. Yes, you can mix TMs from any program, from free TMs and retrieve translations from all of them simultaneously.

Tag handling systems have been developed using intelligent, deep learning as part of our R&D program with the EU’s Expert Project. They solve many interoperability issues and different ways of handling tags. ActivaTM can be accessed from multiple translation platforms and queries simultaneously. See our online documentation ( Translators can work in a variety of translation programs and all access the same translation memory (or a set of them) remotely and instantaneously.

Better still, ActivaTM will compare all incoming material so that it can create triangular TMs. If you have translated a text from English to Chinese and from English to Japanese, presto! You also have a Chinese-to-Japanese translation memory. ActivaTM was first known as “ElasticTM”. It provides unknown levels of flexibility and ownership over linguistic assets at a fraction of the cost.

We have developed ActivaTM as part of our EXPERT Project, with a working grant from the European Union. ActivaTM works as a server-based fast and scalable Translation Memory Manager (TMM) with extremely advanced search and fuzzy match capabilities beyond traditional TM systems.

Technical Specifications

ActivaTM is developed mainly in Python. It is based on ElasticSearch (for storage and full-text TM queries) and Apache Spark (for parallel computations). It has a full-featured REST API so you can open it up immediately to all your freelance or in-house translators. Major features include:

  • Fast TM queries
  • • Reverse language pairs (an English-to-Arabic translation generates Arabic-to-English entries, too)
  • Import/export of TMX files
  • Extensive TM filtering capabilities (for export and queries) by languages, domains, organisations, industry, creation date etc.
  • Built-in new language pair generation using pivot language (for example, French -> Spanish and French -> English will generate Spanish into and out of English)
  • Flexible, customisable fuzzy match (using various regex patterns and linguistic features)
  • XML tag handling (including deep learning model for accurate transferring of XML tags)
  • Supports output mode for Moses for easy integration in Machine Translation flow
  • Compatible with SDL Trados Studio via dedicated plugin (including manual update of translation units)
  • Pretranslation of segments when only one or two items differ (words, dates, etc.)
  • Built-in TM content analysis & maintenance: clean “dirty” segments, create new segments by splitting existing ones based on linguistic rules and POS tagging
  • Advanced user scope management by limiting access to certain language pairs, domains and amount of queries

Where we are



Flat8, 279 Church Road,
Crystal Palace
SE19 2QQ
United Kingdom
+44 203 5400 256


Pangeanic Headquarters

Av. Cortes Valencianas, 26-5,

Ofi 107

46015 Valencia (Spain)

(+34) 96 333 63 33


Castellana 91
Madrid 28046
(+34) 91 326 29 33



One Boston Place
Suite 2600
Boston MA 02108
(617) 621-4084

New York

228 E 45TH St Rm 9E
10017-3337 New York, NY  


Hong Kong

21st Floor, CMA Building
64 Connaught Road Central
Hong Kong
Toll Free: +852 2157 3950


Ogawa Building 3F

3-37 Kanda Sakuma-cho

Chiyoda-ku, Tokyo



Tomson Commercial Building,
Room 316-317
710 Dong Fang Road
Pu Dong, Shanghai 200122, China