Skip to content
View AndreaBozzo's full-sized avatar
:octocat:
:octocat:

Block or report AndreaBozzo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
AndreaBozzo/README.md

πŸ‘‹ Andrea Bozzo

Senior Data Architect | Systems Programmer | Open Source Creator

My path into technology wasn't a straight line, but an inevitable evolution. I've spent my life as a problem-solver, and with a keyboard in front of me for as long as I can remember, programming became my natural language for building solutions.

This journey took me from a background as a Chartered Accountant, through data analysis, and ultimately deep into systems engineering. On my blog, I share technical deep-dives into the projects I contribute to.

Expect complex jargon and unfiltered details as it's written for fellow nerds builders.

I have a few personal projects like Ceres, free templates for BI, hands on starter implementations of Kafka, Flink, PostgreSQL / DuckDB, analytics notebooks and generally useful content for anyone working with data. Leave a ⭐ if you find any of these useful, they are all on free licences.

Thank you ❀️

🌐 Landing Page β€’ πŸ“ Blog β€’ πŸ’Ό LinkedIn β€’ πŸ“§ Email

⚠️ Maintenance Notice: The blog may be temporarily unavailable between December 31st, 2025 and January 2nd, 2026 due to scheduled maintenance. Thank you for your patience!

profile views


πŸ› οΈ Tech Stack

Core Languages Rust Python
Data & Architecture Data Lakehouse Apache Arrow PostgreSQL
Cloud & DevOps Azure Docker GitHub Actions


🌟 Open Source Contributions

Contributing to the broader open source ecosystem beyond my own projects.

This section is automatically updated daily via GitHub Actions

  • pola-rs/polars ⭐ 36684 - 2 merged PRs
    • Extremely fast Query Engine for DataFrames, written in Rust
  • risingwavelabs/risingwave ⭐ 8633 - 1 merged PR
    • Streaming data platform. Real-time stream processing, low-latency serving, and Iceberg table management.
  • supabase/etl ⭐ 2125 - 2 merged PRs
    • Stream your Postgres data anywhere in real-time. Simple Rust building blocks for change data capture (CDC) pipelines.
  • datapizza-labs/datapizza-ai ⭐ 2048 - 3 merged PRs
    • Build reliable Gen AI solutions without overhead πŸ•
  • mariocandela/beelzebub ⭐ 1772 - 1 merged PR
    • A secure low code honeypot framework, leveraging AI for System Virtualization.
  • apache/iceberg-rust ⭐ 1176 - 2 merged PRs
    • Apache Iceberg
  • lakekeeper/lakekeeper ⭐ 1108 - 2 merged PRs
    • Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.
  • italia-opensource/awesome-italia-opensource ⭐ 313 - 1 merged PR
    • Italian Open-Source is the first platform dedicated to Italian open-source world
  • mosaico-labs/mosaico ⭐ 252 - 7 merged PRs
    • Mosaico is the open-source data platform for Robotics and Physical AI
  • pganalyze/pg_query.rs ⭐ 209 - 1 merged PR
    • Parse, deparse and normalize SQL queries using the Postgres source code

πŸ“Š GitHub Stats

GitHub Stats Top Languages GitHub Streak

Contribution Graph

Pinned Loading

  1. Ceres Ceres Public

    Semantic search engine for open data portals built with tokio and pgvector.

    Rust 3

  2. mosaico-labs/mosaico mosaico-labs/mosaico Public

    Mosaico is the open-source data platform for Robotics and Physical AI

    Python 252 11

  3. dataprof dataprof Public

    Fast, reliable data quality assessment for CSV, Parquet, and databases

    Rust 8 1

  4. rust-ita/rust-docs-it rust-ita/rust-docs-it Public

    Documentazione Rust tradotta in italiano

    Shell 2 1

  5. credit-risk-lakehouse credit-risk-lakehouse Public

    Data pipeline for credit risk analytics

    Jupyter Notebook

  6. dce dce Public

    Data Contracts Engine for modern Data platforms. Define, validate, and enforce data quality contracts across multiple formats and cloud providers.

    Rust