Senior Data Engineer @ Entalpic
Shape Entalpic’s data future: build smart pipelines, manage multimodal datasets, and fuel AI for sustainable Chemistry & Materials. Drive impact in climate tech with a dynamic, mission-driven team!
We usually respond within two weeks
We are a dedicated team at the forefront of AI and chemistry, working to accelerate the energy transition. Our focus is on discovering new chemicals and materials that can enable more sustainable practices in sectors with urgent decarbonization needs.
Specifically, we are developing a modern generative AI platform to discover new catalysts that optimize chemical reactions, significantly reduce CO₂ emissions, and help transform carbon-intensive industries.
As an early-stage, AI-driven startup with over €5M in funding, our approach is grounded in state-of-the-art academic research, with a strong focus on simplicity, clarity, and constant optimization.
Join Entalpic to be part of a passionate, fast-growing team united by the belief that technology can drive meaningful impact toward a more sustainable future.
Co-founders: Mathieu Galtier, Victor Schmidt, Alexandre Duval
Entalpic is committed to equal opportunity employment and a diverse, inclusive workplace. We encourage applications from all backgrounds—even if you don’t meet every requirement. If you’re passionate about our mission and think you can contribute, we want to hear from you.
Reporting & Job Location
You will report to the CTO of Entalpic and be based in our Paris office.
Mission Highlights
As a key team member, you will contribute to two main areas:
- Data Infrastructure Development
- Design, build, and maintain scalable data infrastructure to integrate diverse data sources (text, simulations, experiments) in support of ML and LLM applications.
- Data Platform Enhancement
- Lead the development of internal tools to enable efficient, AI-enhanced access to data and promote a data-centric culture across the organization.
Role & Responsibilities
- Data Engineering: Build and optimize scalable data pipelines for simulation (e.g. DFT), textual (e.g. patents, papers), and experimental data (e.g. time series, imagery).
- Data Storage Solutions: Implement and manage secure, scalable data storage systems supporting analytics and ML workflows.
- Automation and Scripting: Create tools and scripts to automate data ingestion, transformation, and processing.
- Data Governance and Lineage: Establish policies for data quality, lineage tracking, and regulatory compliance.
- Infrastructure Support: Work closely with DevOps to integrate solutions with system architecture (AWS/GCP).
- Collaboration and Support: Partner with scientists and experts to meet data needs and enable data-driven decisions.
- Open Source Engagement: Contribute tools and learnings to open-source projects to support the broader community.
Profile
- Master’s or PhD in Computer Science, Data Engineering, or a related field
- 7+ years of experience in data engineering, with proven experience managing diverse data types and building scalable architectures
- Proficiency in at least two programming languages (e.g., Python, Rust, Scala, Go)
- Strong experience with both SQL (MySQL, PostgreSQL) and NoSQL (MongoDB)
- Deep understanding of data modeling, ETL, and data warehousing
- Cloud experience (AWS or GCP) and infrastructure-as-code tools (e.g., Terraform)
- Strong communication skills in English
- Ability to thrive in a fast-paced startup environment
Bonus Skills
- Experience with ML pipelines and AI infrastructure
- Contributions to open-source projects
- Familiarity with scientific data, especially in materials science
Expertise
- Programming: Strong in Python and at least one other language, with best practices in version control (Git)
- Data Management: Expertise in both SQL and NoSQL for large-scale data processing
- Cloud Platforms: Proficient with AWS or GCP and infrastructure-as-code (Terraform)
- DevOps Collaboration: Comfortable with CI/CD, containerization (Docker, Kubernetes)
- Open Source: Experience in contributing to and maintaining open-source libraries and communities
Compensation & Benefits
We are a no-nonsense startup focused on sustainable work culture and meaningful rewards. We offer:
- Competitive salary
- Equity package (BSPCE)
- Comprehensive health insurance (Alan Blue)
- Paid time off aligned with French standards
- A dynamic and supportive work environment with flexibility for remote work
- More to come as we grow!
- Department
- Join our startups
- Locations
- Paris
- Remote status
- Hybrid
- Employment type
- Full-time
Senior Data Engineer @ Entalpic
Shape Entalpic’s data future: build smart pipelines, manage multimodal datasets, and fuel AI for sustainable Chemistry & Materials. Drive impact in climate tech with a dynamic, mission-driven team!
Loading application form