About the job
We are a global biopharmaceutical company focused on human health. Our purpose is to find treatment to fight pain and ease suffering. We combine breakthrough science and advanced technology to develop life-changing medicines and vaccines.
Digital & Data is at the heart of Sanofi: our ambition is to be the leading digital healthcare platform to develop & deliver medicine faster, enable healthcare professionals to improve treatments and help patients improve their health. Our scale, strong connections within health ecosystems across the world, and ability to leverage Sanofi''s capabilities make us the best place to push the boundaries of medicine through technology.
We are building Digital R&D products developing outstanding capabilities that will participate in new drug discovery together with scientists, in trial efficiency and cycle time reduction, leveraging Data Analytics and Artificial Intelligence.
Ensure the quality of the data in coordination with Data Analysts and Data Scientists (peer validation)
Communicate results and findings in a structured way
Partner with Product Owner and Data Analysts to prioritize the pipeline implementation plan
Partner with Data Analysts and Data scientists to design pipelines relevant for business requirements
Leverage existing or create new "standard data pipelines" within Sanofi to bring value through business use cases
Remains up to date on company''s standards, industry practices and emerging technologies
You are a dynamic Data Engineer interested in challenging the status quo to ensure the seamless creation and operation of the data pipelines that are needed for the enterprise data and analytics initiatives following industry standard practices and tools for the betterment of our global patients and customers.
You are a valued influencer and leader who has contributed to making key datasets available to data scientists, analysts, and consumers throughout the enterprise to meet vital business use needs. You have a keen eye for improvement opportunities while continuing to fully comply with all data quality, security, and governance standards.
Experience working with data models, query tuning and data architecture design
Experience working within compliance (e.g.: quality, regulatory - data privacy, GxP, SOX) and cybersecurity requirements is a plus
Product-oriented, flexible, positive team player
Proficient in Data warehousing solutions (Snowflake a plus)
Proficient in using automation tools for CI/CD (Github Action, ArgoCD, CircleCI, )
Good knowledge of Integration Services (Informatica / IICS, Talend or similar data integration services a plus)
Good knowledge of orchestration frameworks (e.g. Apache Airflow, Prefect, Dagster)
Good knowledge of infrastructure as code (Terraform, CloudFormation a plus)
Familiarity with containerization (Docker) and container orchestration (Kubernetes / Openshift a plus)
Familiarity with Source Code Management Tools (GitHub a plus)
Familiarity with Visualization Tools and their interaction with source system (PowerBI, Tableau a plus)