
Innovation Point
Data Engineer
Braga
October 29, 2025
Full-time
About dstgroup
dstgroup one of Portugal biggest construction company, operating at the crossroads of construction, engineering, and digital transformation. With a workforce of over 3,000 employees generating massive daily data streams, our mission is to build robust data foundations that empower advanced AI and NLP teams.
We are seeking a Data Engineer to design and maintain scalable, high-quality data pipelines that make data accessible, structured, and ready for model training and evaluation. Your work will directly enable our AI/NLP team to focus on innovation rather than data preparation.
What you will do
- Design, develop, and maintain ETL/CRISP-DM pipelines to process diverse data streams into structured, reliable datasets.
- Build and optimize SQL-based solutions, with a focus on the PostgreSQL ecosystem, including:
- pgvector for embedding storage and retrieval,
- PostGIS for geospatial analysis,
- TimescaleDB for time-series data.
- Implement and manage REST APIs to expose data products to downstream consumers.
- Ensure data quality, governance, and reproducibility, with special focus on textual/NLP data collections.
- Develop and maintain containerized solutions with Docker, ensuring reproducibility and scalability.
- Use Git and GitLab CI/CD pipelines to automate testing, integration, and deployment of data workflows.
- Collaborate with AI/NLP teams to understand their data requirements and deliver datasets optimized for model training, evaluation, and deployment.
- Integrate open-source tools with Azure cloud services for storage, orchestration, and monitoring.
What we are looking for
- 2+ years of professional experience as a Data Engineer or in a similar role.
- Strong proficiency in SQL and relational databases (especially PostgreSQL).
- Hands-on experience with pgvector, PostGIS, or TimescaleDB.
- Experience designing REST APIs
- Strong Python programming skills, preferably with PySpark.
- Proficiency in Docker for development and production environments.
- Experience with Git and GitLab CI/CD.
- Familiarity with Airflow and Azure cloud services.
- Previous experience collaborating with AI/ML teams, especially in preparing NLP datasets.
Nice to Have (not mandatory):
- Ph.D. in Computer Science, Artificial Intelligence, or a related field.
- Experience both in Academia and Industry
- Have a strong track record of scientific research (in any field), and have done work on information retrieval, knowledge representation and reasoning, structured knowledge extraction, or large-scale data analytics.
- Willingness to mentor younger members and co-supervise master’s theses in collaboration with the universities of Minho and Porto.
What we offer
- The chance to shape the data backbone for one of Portugal’s biggest construction companies.
- Work directly at the interface of data engineering, AI, and NLP, with immediate business impact.
- Hybrid work model with flexibility.
- Competitive salary and benefits package.
- A collaborative, forward-looking environment focused on innovation and data-driven decision-making.
Send your CV to:
info@innovpoint.com or submit on the following form:
https://recrutamento.dstsgps.com/job-offer-details?jobOfferId=891
Contacts and address
Rua de Pitancinhos - Palmeira
4711-911 - Braga