You are viewing a preview of this job. Log in or register to view more details about this job.

Data Scientist

About Us

Emerge Media, Inc. is a leading media and internet company focused in the core areas of search, communication, and translation. Our entire network of sites have received more than 50 million total monthly visits in more than 100 countries. Emerge Media has an 11-year history of success and is located in Chicago’s West Loop neighborhood.

Job Overview

We are looking for a Data Scientist to help secure our competitive advantage within the translation tech industry. You can consider yourself an ideal teammate if you are passionate about foreign language, international business, or using technology to make the world a smaller, more relatable place.   

This position is analytical and data driven. Applicants should be highly inquisitive, tech proficient, and interested in exploring the inner workings of Translate.com, a business-to-business tool that combines artificial intelligence and human intelligence to deliver content in any of over 96 language pairs. Our ideal candidate need not speak a foreign language, but should certainly share our enthusiasm for them! 

Candidate must meet the following technical qualifications:

·       A basic understanding of the different nuances with languages. Examples: Left to Right vs Right to Left, Character Based vs Word Based, Etc.

·       An understanding of how the following technologies work:

o   Natural Language Processing

o   Speech Recognition

o   Speech Synthesis

o   Rule Based Machine Translation

o   Statistical Based Machine Translation

o   Neural Network Machine Translation

o   Optical Character Recognition

·       Ability to calculate Translation Error Rate (TER) and Bilingual Evaluation Understudy (BLEU) scores in multiple languages

·       Ability to calculate Word Error Rate (WER) in multiple languages.

·       Experience evaluating 3rd party service providers.

·       Advanced MySQL or SQL experience.

·       Ability with a web based programming language to do language based API integrations for testing. (Integrate with the Google Translate API to do bulk Machine Translations)