Apply now »

We are experiencing a technical issue with the Careers Website. At the moment, applications are not possible. We are working on resolving the issue as soon as possible. Thank you for your patience.

Text Analytics Data Engineer (PySpark/AWS/MLOps)


A career at our company is an ongoing journey of discovery: our 58,000 people are shaping how the world lives, works and plays through next generation advancements in Healthcare, Life Science and Electronics. For more than 350 years and across the world we have passionately pursued our curiosity to find novel and vibrant ways of enhancing the lives of others.


Your Role

As data engineer for Text Analytics/Natural Language Processing (NLP), you will be working in our growing team as part of Merck Data Office, impacting businesses of Healthcare, Electronics and Life Science. You will be responsible to develop/enhance/maintain data pipelines encompassing enterprise data and NLP models developed by NLP experts. You will be working in agile framework closely with other data engineers, NLP experts and architects to ensure best practices are followed to design, develop, test, deploy and monitor pipelines in a production setup. This position is part of the Group Data Strategy.


Your Profile

  • You have a higher-education degree (Masters) in computer science, information technology, information science, or comparable technical fields
  • 4+ years in engineering with experience in data integration (ELT, ETL) with big data and database platforms, analytics pipeline development, data modeling and data visualization of structured and unstructured data sets.
  • Experience developing data pipelines and applications embedded with NLP/Machine Learning models with high proficiency in Python (PySpark)
  • Experience working with Big data Ecosystem (i.e. distributed computing architectures) like MapReduce, Hive, Spark, Oozie, Sqoop, Kafka and NoSQL databases
  • Experience in document indexing systems such as Elasticsearch, Solr or Lucene
  • Working knowledge of agile software development, version control (git), automated testing, continuous integration/deployment (CI/CD), DevOps, MLOps
  • Experience developing applications using AWS cloud services (e.g., S3, Athena, Lambda, Glue, EMR, Airflow) is a highly desired
  • Experience with techniques used in Machine Learning and/or NLP in Python using libraries such as SKLearn, Gensim, SpaCy, NLTK, and so-forth is a plus
  • Working knowledge of microservices, RESTful APIs, Docker, and Kubernetes is a plus
  • Excellent written and verbal communication skills, with ability to communicate effectively within the team
  • Progressive thinker and problem solver, with a strong ability to manage ambiguity/complexity.


This position is a part of Group Data Strategy.





What we offer:  With us, there are always opportunities to break new ground. We empower you to fulfil your ambitions, and our diverse businesses offer various career moves to seek new horizons. We trust you with responsibility early on and support you to draw your own career map that is responsive to your aspirations and priorities in life. Join us and bring your curiosity to life!


Curious? Apply and find more information at https://jobs.vibrantm.com


We are committed to promoting a diverse and inclusive workforce. Applications from individuals are encouraged regardless of age, disability, sex, gender reassignment, sexual orientation, pregnancy and maternity, race, religion or belief and marriage and civil partnerships.


Job Requisition ID:  221520
Location:  Bangalore
Career Level:  D - Professional (4-9 years)
Working time model:  full-time

Careers during Covid-19
Thank you for visiting our careers website, we are always looking for curious minds to join our teams. We understand how much the world is being impacted by the Covid-19 crisis and we want to assure you that your safety is very important to us. To ensure that everyone’s health is protected, instead of a standard face-to-face interview, it is likely that you will be offered alternative digital interview options. 

US employees must be fully vaccinated against COVID-19 prior to your start date unless an accommodation is granted by the Company. The Company uses the definition of “fully vaccinated” assigned by the Centers for Disease Control & Prevention for purposes of considering satisfaction of this requirement which is a condition of employment.

North America Disclosure
The Company is committed to accessibility in its workplaces, including during the job application process. Applicants who may require accommodation during the application process should speak with our Candidate Services team at 844-655-6466 from 8:00am to 5:30pm ET Monday through Friday. If you are a resident of a Connecticut or Colorado, you are eligible to receive additional information about the compensation and benefits, which we will provide upon request.  You may contact 855 444 5678 from 8:00am to 5:30pm ET Monday through Friday, for assistance.

Notice on Fraudulent Job Offers
Unfortunately, we are aware of third parties that pretend to represent our company offering unauthorized employment opportunities. If you think a fraudulent source is offering you a job, please have a look at the following information.

Job Segment: Database, Analytics, Embedded, Computer Science, Cloud, Management, Technology

Apply now »