Innovation Point

Innovation Point's mission is to identify and assess innovative ideas and opportunities, develop and prototype them, and finally make them available by introducing them to the markets.
About company

Data Engineer

Remote

location Braga

date October 29, 2025

types Full-time

About dstgroup

dstgroup one of Portugal biggest construction company, operating at the crossroads of construction, engineering, and digital transformation. With a workforce of over 3,000 employees generating massive daily data streams, our mission is to build robust data foundations that empower advanced AI and NLP teams.

We are seeking a Data Engineer to design and maintain scalable, high-quality data pipelines that make data accessible, structured, and ready for model training and evaluation. Your work will directly enable our AI/NLP team to focus on innovation rather than data preparation.

What you will do

- Design, develop, and maintain ETL/CRISP-DM pipelines to process diverse data streams into structured, reliable datasets.

- Build and optimize SQL-based solutions, with a focus on the PostgreSQL ecosystem, including:

- pgvector for embedding storage and retrieval,

- PostGIS for geospatial analysis,

- TimescaleDB for time-series data.

- Implement and manage REST APIs to expose data products to downstream consumers.

- Ensure data quality, governance, and reproducibility, with special focus on textual/NLP data collections.

- Develop and maintain containerized solutions with Docker, ensuring reproducibility and scalability.

- Use Git and GitLab CI/CD pipelines to automate testing, integration, and deployment of data workflows.

- Collaborate with AI/NLP teams to understand their data requirements and deliver datasets optimized for model training, evaluation, and deployment.

- Integrate open-source tools with Azure cloud services for storage, orchestration, and monitoring.

What we are looking for

- 2+ years of professional experience as a Data Engineer or in a similar role.

- Strong proficiency in SQL and relational databases (especially PostgreSQL).

- Hands-on experience with pgvector, PostGIS, or TimescaleDB.

- Experience designing REST APIs

- Strong Python programming skills, preferably with PySpark.

- Proficiency in Docker for development and production environments.

- Experience with Git and GitLab CI/CD.

- Familiarity with Airflow and Azure cloud services.

- Previous experience collaborating with AI/ML teams, especially in preparing NLP datasets.

Nice to Have (not mandatory):

- Ph.D. in Computer Science, Artificial Intelligence, or a related field.

- Experience both in Academia and Industry

- Have a strong track record of scientific research (in any field), and have done work on information retrieval, knowledge representation and reasoning, structured knowledge extraction, or large-scale data analytics.

- Willingness to mentor younger members and co-supervise master’s theses in collaboration with the universities of Minho and Porto.

What we offer

- The chance to shape the data backbone for one of Portugal’s biggest construction companies.

- Work directly at the interface of data engineering, AI, and NLP, with immediate business impact.

- Hybrid work model with flexibility.

- Competitive salary and benefits package.

- A collaborative, forward-looking environment focused on innovation and data-driven decision-making.

Send your CV to:

info@innovpoint.com or submit on the following form:

https://recrutamento.dstsgps.com/job-offer-details?jobOfferId=891

Contacts and address

earth Rua de Pitancinhos - Palmeira 4711-911 - Braga