This repository provides a comprehensive set of tutorials to help you master Snowflake, the cloud data warehousing platform. Each section covers key concepts and practical examples to accelerate your learning.
- What is Snowflake?
- Snowflake’s three-layer cloud architecture (storage, compute, cloud services)
- Key components: Databases, schemas, warehouses, roles
- Data types, schemas, tables
- Basic DML: SELECT, INSERT, UPDATE, DELETE
- Filtering, sorting, aggregations, joins, subqueries
- Window functions, CTEs, recursive queries
- Transactions and concurrency control
- Query optimization and tuning
- COPY INTO command
- External and internal stages
- File formats
- Roles and privileges
- Data masking and row-level security
- What are stored procedures?
- Writing procedures in SQL and JavaScript
- Procedure control flow: variables, loops, conditionals
- Calling, managing, and error handling in procedures
- Introduction to JavaScript usage
- Writing JavaScript UDFs and procedures
- Using JavaScript API objects (e.g.,
Snowflake.createStatement()) - Integration with SQL commands
- Debugging and best practices
- Introduction to Snowpark
- What is Snowpark and how it integrates with Snowflake?
- Snowpark APIs overview (Java, Scala, Python)
- Installation and configuration
- Authentication and connection to Snowflake
- DataFrames and Dataset APIs
- Transformations and actions
- Creating and using UDFs in Snowpark
- Writing stored procedures with Snowpark
- Working with semi-structured data (JSON, XML)
- Integrating external libraries
- Optimization best practices
- ETL pipelines
- Machine learning workflows
- Streaming data processing
Feel free to explore the tutorials and contribute!