I am software developer with over 4 years of experience passionate about building performant and user-centered software solutions that bring value. Focused on data engineering, backend development, and DevOps.
Skills
Data
Python Semantic web Apache Airflow SQL databases MongoDB Apache Spark Pandas LaTeX Big data Seaborn Matplotlib PostgreSQL Hadoop Apache Hive SQLite
Mathematics
Management
Agile Methodology
Experience
Site Reliability Engineer at Albert-Ludwigs-Universität Freiburg
Galaxy is an open-source data analysis platform focused on bioinformatics and powered by a high-performance computing network. It is driven by a global community of developers, researchers and infrastructure engineers. In the Freiburg Galaxy team, we are three site reliability engineers and our job is to ensure that usegalaxy.eu works reliably.
Due to the scale of our platform (more than 90.000 registered users and up to 800 simultaneous users) and the variety of user workloads, we have a large and heterogeneous infrastructure, provided by the University Computing Center as an OpenStack cloud and managed both using Terraform and the OpenStack API. Image builds and application deployments are automated using Jenkins and Ansible. User workloads are delivered to the HPC network using HTCondor and Pulsar. InfluxDB and Grafana let us keep a check on everything, and Sentry watches for bugs within the platform.
- Monitor availability, performance and errors. Incident response and analysis.
- Manage the infrastructure ensuring reliability and scalability.
- Upgraded job scheduler HTCondor to newest LTS release with zero downtime.
- Implemented GitHub automations to enhance contributors' experience.
- Contributed to Galaxy codebase, improving observability and fixing bugs.
Software developer at Fraunhofer IWM
Our team was dedicated to the development of software solutions enabling digitalization in the domain of materials science. The aim was to facilitate the publication not only of research outcomes, but also of the data generated during the process, together with its metadata and context in standard formats (FAIR data). The technology that we used to realize that vision were knowledge graphs based on the W3C Semantic Web standards (RDF and the Web Ontology Language). In this context, I was mainly involved in the UrWerk, EOSC-Pillar, and SimDOME projects, as well as the development of SimPhoNy.
- Led the design and implementation of an ontology search engine for the EOSC-Pillar project and developed the crawler component.
- Developed ETL pipelines for the UrWerk project producing knowledge graphs depicting the manufacturing history of materials.
- Maintained the SimPhoNy framework for interoperability based on knowledge graphs as part of the SimDOME project.
- Conducted workshops on FAIR data, knowledge graphs and the Semantic Web.
Data engineering internship at Endesa (Enel Group)
We worked together as a team, using an agile software development methodology, on a project in which we used machine learning and deep learning algorithms with the purpose of detecting energy fraud. The necessary work to accomplish such a task could be split in two parts: the design and training of the models and the preparation of the data that they needed to be trained.
As we had to deal with huge volumes of data, we made use of technologies such as Apache Spark (via the Python API, PySpark) and Hadoop to carry out the process of loading, normalizing and processing the raw data that we possessed, transforming them in a dataset that was adequate for the training of the machine learning and deep learning Models.
- Desing and implement ETL pipelines that periodically integrate new data on the company's 11 million customers into the team's data warehouse.
- Compute training datasets and continuously improve their quality by cleaning data, engineering variables and analyzing false positives and false negatives.
- Extract insights from data to support decision-making.
Mathematical finance internship at Instituto IBT
Collaboration in a research project on stock markets. I used MATLAB to compute economic indicators and find patterns of interest.
Education
BEng in Computer Science
As I come from a scientific background, I am pursuing this bachelor's degree with the aim of acquiring a comprehensive understanding of software engineering practices and delve into all of its aspects.
I study at UNED, a Spanish university focused on remote learning with 50 years of history. I can learn no matter where I am, and take examinations all around the world. As started with a strong background, rather than following the prescribed order, I usually pick the subjects I need the most. At the moment, I have completed 28% of the study program.
MSc in Applied Mathematics (MathMods)
Master's degree in mathematics with a focus on modelling for engineering applications. The first semester covers theory, the second numerical methods, and the third and fourth focus on a specific application.
For the third and fourth semesters, I chose to specialize in optimization and wrote a master thesis on solving a problem in the field of operations research using constraint programming: "Vehicle Routing Problems in Airport Services Management." - read it here.
MathMods belonged to the Erasmus Mundus Joint Masters framework. Erasmus Mundus programs involve several institutions from different countries, across which the students, that come from all over the world, move during the course of their studies.
BSc in Physics
Bachelor's degree in Physics. As I was attracted toward the applied side of science and computing, I focused on electromagnetism, electronics and signal processing. I wrote a bachelor’s thesis on side-channel attacks: "Electrical signal processing for the optimization of side-channel attacks on cryptographic circuits." - available on idUS (in Spanish).
International Baccalaureate Diploma Programme
2011 - 2013
The International Baccalaureate Diploma Programme is an educational program targeting teenagers taught in 140 countries around the world. It provides a well-rounded education covering not just plain knowledge, but also critical thinking skills and social awareness.
Certifications
Name | Level | Date |
---|---|---|
Certificazione CILS Livello Tre | C1 | 2020 |
TestDAF | B2 | 2016 |
Cambridge English Advanced Diploma | C1 | 2014 |