Giulio Piccolo - Lead Data Engineer, professional headshot

Giulio Piccolo

Lead Data Engineer

Results-oriented Data Engineer with a proven track record in designing and optimizing large-scale data solutions that drive strategic business outcomes.

About Me

Results-oriented Data Engineer with a proven track record in designing and optimizing large-scale data solutions that drive strategic business outcomes. Drawing on my background as a former competitive athlete, I bring tenacity, teamwork, and discipline to every project.

My end-to-end expertise in data engineering and warehousing underpins real-time decision-making and enhances operational efficiency. As a confident communicator, I excel at collaborating with cross-functional teams and stakeholders, consistently delivering robust, on-time, and cost-effective data products that fuel sustained growth, speed, and innovation in business processes.

Professional Experience

Tech Lead Data Platform
Suitsupply - Amsterdam, NL
Nov '24 - Present
Leading the design and implementation of scalable backend services and real-time data pipelines for processing web events and analytics, resulting in significant performance improvements.
  • Designed and implemented scalable backend services and real-time data pipelines using Airbyte, Python, dbt, Datastream, and Kubernetes resulting in a 150% increase in pipeline delivery time
  • Collaborated with Product Managers, Data Scientists, and Marketing Specialists to architect comprehensive end-to-end ETL solutions, reducing data latency by 50% and enhancing system reliability by 30%
  • Led code reviews and mentored junior engineers during Learning Thursday, enforcing coding standards and optimizing complex data ingestion frameworks
Key Technologies:
Python Airbyte dbt Datastream Kubernetes ETL
Lead Data Engineer
Road - Amsterdam, NL
Oct '23 - Oct '24
Led the successful implementation of a robust data platform, creating both the Data Lake and Data Warehouse while significantly enhancing performance and data accuracy.
  • Led the successful implementation of a robust data platform comprising Python, Spark, dbt, BigQuery, Airflow, Kafka, and Looker, effectively creating both the Data Lake and Data Warehouse
  • Significantly enhanced the ELT process by implementing advanced Python techniques such as concurrent code execution and task parallelization, resulting in a remarkable 300% increase in computation speed
  • Spearheaded collaboration between data analysts and software engineers achieving a 40% improvement in data accuracy and a 90% reduction in data anomalies
Key Technologies:
Python Apache Spark dbt BigQuery Airflow Kafka Looker
Data Engineer
Slido (part of Cisco) - Remote
Apr '22 - Oct '23
Implemented data pipelines for ingesting and transforming large volumes of data, designed scalable infrastructure, and established best practices for data quality and reliability.
  • Implemented data pipelines for ingesting and transforming large volumes of data, resulting in a 35% reduction in errors during the ELT process using Python, Airflow, PySpark, and AWS
  • Designed and maintained a scalable data infrastructure on AWS using Apache Kafka, AWS Kinesis and Clickhouse allowing for seamless integration of new data sources and reducing downtime by 50%
  • Collaborated with cross-functional teams to establish best practices for software design patterns and CI/CD, leading to a 4 hours/week reduction in time spent on manual testing
  • Implemented data quality & reliability tests for both technical and non-technical stakeholders, while advising other teams on data analytics tools
Key Technologies:
Python Airflow PySpark AWS Kafka Kinesis Clickhouse S3 Redshift
Data Analyst
Cisco Systems - Remote
Jan '21 - Mar '22
Involved in multivariate segmentation and market intelligence projects, working with large customer databases and applying machine learning techniques for business optimization.
  • Involved in multivariate segmentation starting from a database of over 300k customers, applying unsupervised machine learning algorithms (k-means++) and dimensional reduction techniques (PCA)
  • Extracted data with SQL and inspected them using Tableau to identify sales force's improvement areas in Southern Europe, leading to optimization in both sales distribution and accounts priority
  • Collaborated in investigating market intelligence related projects (modelling of data related to macro-economy, Total Addressable Market (TAM) and Market Shares)
Key Technologies:
SQL Tableau Python Machine Learning k-means PCA
Junior Data Analyst
SLCD - Legal & Corporate Advisors - Italy
Jun '19 - Present
Drove data solutions in a versatile legal boutique by designing data visualizations and conducting data analysis to support business strategy beyond traditional legal scopes.
  • Drove data solutions in a versatile legal boutique by designing and developing data visualizations using Tableau to support business strategy, enabling tailored problem-solving beyond traditional legal scopes
  • Conducted data cleaning and preprocessing tasks to ensure high data quality and integrity, reducing errors by 20%, and built seamless data infrastructures for effective collaboration
  • Collaborated with senior analysts to develop predictive models using Python and scikit-learn, delivering actionable insights with advanced analytics and machine learning
Key Technologies:
Tableau Python scikit-learn Data Visualization Machine Learning

Open Source Contributions

Sliger

Pythonizing Google Slides for automated presentation generation

Sliger saves time, money and mental sanity by Pythonizing Google Slides for people whose day-to-day job it is to present Slides from templates manually editing the repetitive fields. This project was showcased at PyCon Italy 2023 and PyCon UK 2023.

Key Features:

  • Uses Jinja2 templates integrated into Google Slides for dynamic content generation
  • Converts Google Slides to plaintext leveraging Google Cloud API requests for seamless automation
  • Streamlines presentation creation workflows reducing manual editing time significantly
  • Featured at major Python conferences demonstrating real-world impact and adoption
1.7k+ Downloads
PyCon Presenter
PyPI Published

Technologies Used:

Python Google Cloud API Jinja2 Typer Streamlit Google Slides

Get In Touch

I'm always interested in discussing new opportunities, collaborating on interesting projects, or simply connecting with fellow data professionals.

Email

giulio [dot] piccolo [at] me [dot] com