Internship Data Engineer at Johnson & Johnson (stage)

6-3-2019 beerse
company profile

Caring for the world, one person at a time, inspires and unites the people of Johnson & Johnson. We embrace innovation - bringing ideas, products and services to life to advance the health and well-being of people around the world. We believe in collaboration which has led to breakthrough after breakthrough, from medical miracles that have changed lives, to the simple consumer products that make every day a little better. Our over 125,000 employees in 60 countries are united in a common mission: To help people everywhere live longer, healthier, happier lives.

The IT Insights Data Lake team is a DevOps team developing and supporting data and BI solutions for the J&J Technology Services organization and its IT business units. Day-to-day this team is focused on activities in the space of data engineering, advanced analytics and dashboarding using technologies like Cloudera Hadoop, Kafka, Neo4J graph database, Spark, Python and Tableau. The IT Insights data lake platform is serving over 500 data analysts with data and self-service analytics capabilities.

We currently have an internship position for a data engineer.

job description

The intern will work on technical implementation of data engineering solution focused around big data ingestion using technologies like Sqoop, Hive and Impala. She or he will be mainly responsible for developing a data ingestion framework in Python to bring several RDBMS data sources to our Cloudera Hadoop based data lake. During the course of the internship, we provide the intern the opportunity to get acquainted with other areas for a side project of her or his choice: streaming data analytics, machine learning, Tableau dashboarding, CI/CD and test automation,..

your contribution

Responsibilities and duties
- Main assignment (60%):
o End-to-end development in Python of data ingestion framework using Sqoop, Hive, Impala, Oozie and Yaml configuration files.
o Use the current data ingestion framework developed using Bash shell to form an understanding of different types of data inflow.
o Development of Python unit and regression tests to guarantee code quality.
o Writing of technical documentation on how to use and configure the framework.
o Implementation of error detection and alerting.
- Small side project (40%) in the area of streaming data analytics, machine learning, Tableau dashboarding, CI/CD and test automation,..

- Well-practiced programming and debugging skills, preferably in Python.
- Familiar with RDBMS technologies.
- Eagerness to learn new things.

Practical details
- Location: Beerse, Belgium
- Duration: 10 - 12 weeks
- Timeframe: Somewhere between Feb 4th - Dec 13th, 2019

The candidate must be a registered student for the entire course of the internship.
Unfortunately, graduates that are currently not enrolled in a study program are not eligible for this internship.
If you are interested in applying for this challenging internship, please send an e-mail to, including your resume and a short motivation.

functional analyst / business analyst (m/f)
employment type
full time
contract type
temporary contract
Chemische en farmaceutische industrie
your Randstad contact
014-60 71 13
014-60 71 13