José Manuel Domínguez
Resume
About me
Skills
Data
Coding
Operations
Speaking
Mathematics
Management
Experience
Albert-Ludwigs-Universität Freiburg
Fraunhofer IWM
Endesa (Enel Group)
Instituto IBT
Education
BEng in Computer Science
MSc in Applied Mathematics (MathMods)
BSc in Physics
International Baccalaureate Diploma Programme
Certifications
About me
Skills
Data
Coding
Operations
Speaking
Mathematics
Management
Experience
Albert-Ludwigs-Universität Freiburg
Fraunhofer IWM
Endesa (Enel Group)
Instituto IBT
Education
BEng in Computer Science
MSc in Applied Mathematics (MathMods)
BSc in Physics
International Baccalaureate Diploma Programme
Certifications

I am software developer with over 4 years of experience passionate about building performant and user-centered software solutions that bring value. Focused on data engineering, backend development, and DevOps.

Skills

Data
🗄️

Python Semantic web Apache Airflow SQL databases MongoDB Apache Spark Pandas LaTeX Big data Seaborn Matplotlib PostgreSQL Hadoop Apache Hive SQLite

Coding
👨‍💻

Python GitHub Git Docker SQL databases MongoDB MATLAB Backend development Django PostgreSQL Fortran Bootstrap SQLite

Operations
⚙️

Python GitHub Docker GitHub Actions Ansible Linux Continous delivery and continuous integration Bash Packer Sentry Jenkins OpenStack HTCondor Grafana High performance computing Terraform InfluxDB

Speaking
🇨🇽

English Italian Spanish German

Mathematics
🧮

Operations research (combinatorial optimization) Gurobi

Management
💼

Agile Methodology

Experience

Site Reliability Engineer at Albert-Ludwigs-Universität Freiburg
Apr. 2023 - Now, Freiburg

Galaxy is an open-source data analysis platform focused on bioinformatics and powered by a high-performance computing network. It is driven by a global community of developers, researchers and infrastructure engineers. In the Freiburg Galaxy team, we are three site reliability engineers and our job is to ensure that usegalaxy.eu works reliably.

Due to the scale of our platform (more than 90.000 registered users and up to 800 simultaneous users) and the variety of user workloads, we have a large and heterogeneous infrastructure, provided by the University Computing Center as an OpenStack cloud and managed both using Terraform and the OpenStack API. Image builds and application deployments are automated using Jenkins and Ansible. User workloads are delivered to the HPC network using HTCondor and Pulsar. InfluxDB and Grafana let us keep a check on everything, and Sentry watches for bugs within the platform.

  • Monitor availability, performance and errors. Incident response and analysis.
  • Manage the infrastructure ensuring reliability and scalability.
  • Upgraded job scheduler HTCondor to newest LTS release with zero downtime.
  • Implemented GitHub automations to enhance contributors' experience.
  • Contributed to Galaxy codebase, improving observability and fixing bugs.

Software developer at Fraunhofer IWM
Feb. 2021 - Mar. 2023, Freiburg

Our team was dedicated to the development of software solutions enabling digitalization in the domain of materials science. The aim was to facilitate the publication not only of research outcomes, but also of the data generated during the process, together with its metadata and context in standard formats (FAIR data). The technology that we used to realize that vision were knowledge graphs based on the W3C Semantic Web standards (RDF and the Web Ontology Language). In this context, I was mainly involved in the UrWerk, EOSC-Pillar, and SimDOME projects, as well as the development of SimPhoNy.

  • Led the design and implementation of an ontology search engine for the EOSC-Pillar project and developed the crawler component.
  • Developed ETL pipelines for the UrWerk project producing knowledge graphs depicting the manufacturing history of materials.
  • Maintained the SimPhoNy framework for interoperability based on knowledge graphs as part of the SimDOME project.
  • Conducted workshops on FAIR data, knowledge graphs and the Semantic Web.

Data engineering internship at Endesa (Enel Group)
Jan. 2020 - Jan. 2021, Seville

We worked together as a team, using an agile software development methodology, on a project in which we used machine learning and deep learning algorithms with the purpose of detecting energy fraud. The necessary work to accomplish such a task could be split in two parts: the design and training of the models and the preparation of the data that they needed to be trained.

As we had to deal with huge volumes of data, we made use of technologies such as Apache Spark (via the Python API, PySpark) and Hadoop to carry out the process of loading, normalizing and processing the raw data that we possessed, transforming them in a dataset that was adequate for the training of the machine learning and deep learning Models.

  • Desing and implement ETL pipelines that periodically integrate new data on the company's 11 million customers into the team's data warehouse.
  • Compute training datasets and continuously improve their quality by cleaning data, engineering variables and analyzing false positives and false negatives.
  • Extract insights from data to support decision-making.

Mathematical finance internship at Instituto IBT
Nov. 2016 - Feb. 2017, Seville

Collaboration in a research project on stock markets. I used MATLAB to compute economic indicators and find patterns of interest.

Education

BEng in Computer Science
Universidad Nacional de Educación a Distancia
2019 - Now

As I come from a scientific background, I am pursuing this bachelor's degree with the aim of acquiring a comprehensive understanding of software engineering practices and delve into all of its aspects.

I study at UNED, a Spanish university focused on remote learning with 50 years of history. I can learn no matter where I am, and take examinations all around the world. As started with a strong background, rather than following the prescribed order, I usually pick the subjects I need the most. At the moment, I have completed 28% of the study program.

MSc in Applied Mathematics (MathMods)
Università degli Studi dell'Aquilla
Universität Hamburg
2017 - 2019

Master's degree in mathematics with a focus on modelling for engineering applications. The first semester covers theory, the second numerical methods, and the third and fourth focus on a specific application.

For the third and fourth semesters, I chose to specialize in optimization and wrote a master thesis on solving a problem in the field of operations research using constraint programming: "Vehicle Routing Problems in Airport Services Management." - read it here.

MathMods belonged to the Erasmus Mundus Joint Masters framework. Erasmus Mundus programs involve several institutions from different countries, across which the students, that come from all over the world, move during the course of their studies.

BSc in Physics
Westfälische Wilhelms-Universität Münster
Universidad de Sevilla
2013 - 2017

Bachelor's degree in Physics. As I was attracted toward the applied side of science and computing, I focused on electromagnetism, electronics and signal processing. I wrote a bachelor’s thesis on side-channel attacks: "Electrical signal processing for the optimization of side-channel attacks on cryptographic circuits." - available on idUS (in Spanish).

International Baccalaureate Diploma Programme
IES Martínez Montañés
2011 - 2013

The International Baccalaureate Diploma Programme is an educational program targeting teenagers taught in 140 countries around the world. It provides a well-rounded education covering not just plain knowledge, but also critical thinking skills and social awareness.

Certifications

Name Level Date
Certificazione CILS Livello Tre C1 2020
TestDAF B2 2016
Cambridge English Advanced Diploma C1 2014