About me
Hey👋 my name is Luis; I love working with data and learning about new techniques to store it 📖 , query it 🔍 and gather insights 💡 I am interested in building effective and reliable data pipelines, deep-diving into analyses, and asserting that data is sound and reproducible. For my tasks, I mainly work with Python 🐍 (although R is becoming a staple in the toolbox 🧰 ) using libraries such as Scikit, pandas, and Numpy, but I'm always excited to take a look at new tools and tech. When it comes to storing and querying data, I mainly have experience with relational databases like Postgres and MySQL, and data warehouses like BigQuery or Snowflake. In ETL pipelines, I've worked with Airbyte and DBT with a focus on designing incremental models⏳ Currently, I am also exploring tools like Dagster, dlt and SQLMesh for building a stack aimed towards incremental modeling with frequent updates, where timing is an important factor. I am always happy to connect with anyone and explore mutual opportunities. Looking forward to connecting!
Tools and technologies
Experience
Prodege, LLC
Associate Business Intelligence Engineer
Joined the Prodege BI team after BitBurst was fully merged with Prodege.
Migrated legacy SQL models running on Autosys to dbt models, optimising for incremental transformations and incorporating deduplication.
Designed feature models for the feature layer, consolidating feature definitions and advocating for feature reuse to reduce computation spend.
Introduced CI/CD pipelines to deploy Airflow DAGs as well as mirror MWAA dependencies with the repo's dependency manager.
Designed the dev workflow for dbt core, including CI/CD and pre-commit hooks to enforce standards (SQLfluff, dbt-bouncer, custom hooks/tests) and updated dbt YML schemas.
Developed a tool to import source tables as dbt staging models, mirror source DB constraints as dbt tests to catch CDC data corruption, detect PII (using Presidio) and write accurate source/table/model/column descriptions.
BitBurst GmbH
Associate Business Intelligence Enginner
Optimized models with a star schema, maintaining backward compatibility via views for seamless user experience.
Introduced CI/CD pipelines to ensure code quality, model testing and seamless deployment of changes.
Migrated the ELT pipeline from Airbyte and dbt Cloud to Dagster, dltHub and SQLMesh.
BI Data Analytics
Developed robust ELT using Airbyte and DBT, significantly improving data quality.
Implemented version control for DBT models and ELT processes with Git, enhancing collaboration.
Analyzed internal metrics to identify fraud patterns, increasing detection accuracy with new statistical rules.
Created interactive dashboards with Metabase/Sigma, facilitating data-driven decision-making across teams.
Allianz Kunde und Markt GmbH
Working Student
Supported team in gathering customer satisfaction insights and developing internal dashboards for visualization.
Conducted data preprocessing, knowledge mining, statistical analyses, and communicated insights to stakeholders.