- Видео 426
- Просмотров 612 026
The Data Guy
США
Добавлен 6 апр 2013
Your one stop shop for all your Data needs! Have a hard problem you'd like solved but don't know how? Send it to me at gyatesofficial@gmail.com and I'll make a video on it!
www.linkedin.com/in/george-yates/
www.linkedin.com/in/george-yates/
All Apache Data Formats Explained! Apache Feather Vs. Avro Vs. ORC V. Parquet!
In this video, I'll run through all the most popular data format Apache projects so you can choose the best option for your use case!
Просмотров: 52
Видео
How to Build an ELT Pipeline with Postgres, Apache Airflow, and dbt with Cosmos!
Просмотров 97День назад
In this video, I'll show you how you can build an ELT pipeline that loads data from an API into a postgres database, and then uses dbt to transform it within the Postgres database.
How to Choose the Right Tools for Your Data Tech Stack! Data Tools Decision Making Guidelines!
Просмотров 2382 дня назад
In this video, I'll give you a series of guidelines you can use to choose new data products! This will cover things like servers vs. serverless, modular vs monolith, and more! Join My Discord for Any Questions or Code: discord.gg/JkjvyYmFcx
How to Build a Reverse ETL Pipeline with Airflow, Snowflake, and Salesforce!
Просмотров 20914 дней назад
In this video, I'll show you how you can use Airflow to build a reverse ETL pipeline with Airflow, Snowflake, and Salesforce to take customer data out of Salesforce, store it in a master customer database, enrich the data, and then sync it back into Salesforce so you have the most relevant data for each customer! Join My Discord for Any Questions or Code: discord.gg/JkjvyYmFcx
How to Get Started with LakeSail & PySail for Spark! Spark Compute Framework Lakesail Explained!
Просмотров 15914 дней назад
In this video, I'll teach you the basics of pysail, how it can help your spark workloads more efficiently, and how you can get started using it! Join My Discord for Any Questions or Code: discord.gg/JkjvyYmFcx
How to Become a Sales Engineer! Sales Engineering Explained!
Просмотров 14414 дней назад
In this video, I'll explain what Sales Engineering is, what Sales Engineers do, and the skills you'll need to learn to become one! Join My Discord for Any Questions or Code: discord.gg/JkjvyYmFcx
How to Use FlinkML and MLLib for ML Model Training and Retraining!
Просмотров 18914 дней назад
In this video, I'll show you how you can use FlinkML and MLLib for Machine learning model training and consistent retraining! Link to github repo is below: github.com/apache/flink-ml
How to Clean Your Data! Data Cleaning Techniques and Examples for Beginners!
Просмотров 424Месяц назад
In this video, I'll teach you all of the basics of how to clean your data so it's ready for any downstream use cases! Join My Discord for Any Questions or Code: discord.gg/JkjvyYmFcx
Beginner's Guide to Apache Ray! Apache Ray Explained
Просмотров 362Месяц назад
In this video, I'll teach you everything you need to know about Apache Ray! Join My Discord for Any Questions or Code: discord.gg/JkjvyYmFcx
Apache Iceberg Vs. Delta Lake Vs. Apache Hudi! Data Lake Storage Solutions Compared!
Просмотров 656Месяц назад
In this video, I'll break down the pros and cons of 3 of the most popular Data Lake storage solutions out there on the market, Apache Iceberg, Databricks Delta Lake and Apache Hudi!
Data Scientist Zero to Hero Guide! Everything You Need to Learn to Get a Job in Data Science!
Просмотров 218Месяц назад
In this video, I'll give you a roadmap of everything you need to know to become a Data Scientist! Join My Discord for Any Questions or Code: discord.gg/JkjvyYmFcx
How to Build an ELT Pipeline with Google BigQuery, Apache Airflow, and dbt!
Просмотров 896Месяц назад
In this video, I'll show you how you can build an ELT pipeline where we'll pull data from an api, store it in Google cloud storage, then upload it into BigQuery and use dbt with Cosmos to transform our data in BigQuery! Join My Discord for Any Questions or Code: discord.gg/JkjvyYmFcx
RabbitMQ Vs. Apache Kafka! RabbitMQ and Apache Kafka Explained, Compared and Contrasted!
Просмотров 1,2 тыс.Месяц назад
In this video, I'll explain the differences and best use cases for RabbitMQ and Apache Kafka! Join My Discord for Any Questions or Code: discord.gg/JkjvyYmFcx
How to Run Java Applications with Apache Airflow! Learn to Trigger & Monitor Remote VMs from Airflow
Просмотров 134Месяц назад
In this video, I'll teach you how you can trigger and monitor java applications on a remote VM from Airflow! Join My Discord for Any Questions or Code: discord.gg/JkjvyYmFcx
How to Create a Medallion Data Architecture! Medallion Architecture Guide & Best Practices!
Просмотров 464Месяц назад
In this video, I'll teach you everything you need to know about what a Medallion data architecture is, and how you can set one up yourself to production grade standards! Join My Discord for Any Questions or Code: discord.gg/JkjvyYmFcx
How to Run Apache Airflow in Production! Best Practices for Running Apache Airflow at Scale!
Просмотров 1 тыс.2 месяца назад
How to Run Apache Airflow in Production! Best Practices for Running Apache Airflow at Scale!
Snowflake Vs. AWS RedShift Vs. GCP BigQuery Vs. Azure Synapse for Data Warehousing!
Просмотров 9682 месяца назад
Snowflake Vs. AWS RedShift Vs. GCP BigQuery Vs. Azure Synapse for Data Warehousing!
How to Run Talend Tasks Using Apache Airflow and Create a Talend Operator!
Просмотров 1442 месяца назад
How to Run Talend Tasks Using Apache Airflow and Create a Talend Operator!
How to Collect and Visualize Lineage Data from your Data Pipelines with Apache Airflow!
Просмотров 4122 месяца назад
How to Collect and Visualize Lineage Data from your Data Pipelines with Apache Airflow!
How to Set Up a Data Lake in Production! Data Lake Best Practices Guide
Просмотров 5022 месяца назад
How to Set Up a Data Lake in Production! Data Lake Best Practices Guide
How to Use Polars, the Modern Pandas Alternative! Getting Started with Polars for Python!
Просмотров 4152 месяца назад
How to Use Polars, the Modern Pandas Alternative! Getting Started with Polars for Python!
How to Build a Production ML Pipeline with Apache Airflow, Databricks, Kafka, and MLFlow!
Просмотров 5812 месяца назад
How to Build a Production ML Pipeline with Apache Airflow, Databricks, Kafka, and MLFlow!
How to Use Apache Flink and Apache Kafka to Do Real Time Stream Processing!
Просмотров 9962 месяца назад
How to Use Apache Flink and Apache Kafka to Do Real Time Stream Processing!
How to Develop Spark Scripts Locally Before Deploying Them to a Databricks Cluster!
Просмотров 4592 месяца назад
How to Develop Spark Scripts Locally Before Deploying Them to a Databricks Cluster!
How to Build an ETL Pipeline with Airbyte, Apache Airflow, and Snowflake!
Просмотров 4752 месяца назад
How to Build an ETL Pipeline with Airbyte, Apache Airflow, and Snowflake!
How to Use AWS Lambda and Apache Airflow to Create an ETL and Machine Learning Pipeline!
Просмотров 3402 месяца назад
How to Use AWS Lambda and Apache Airflow to Create an ETL and Machine Learning Pipeline!
How to Build an ELT Pipeline with AWS Redshift, Apache Airflow and dbt!
Просмотров 1,3 тыс.3 месяца назад
How to Build an ELT Pipeline with AWS Redshift, Apache Airflow and dbt!
dbt Core Vs. SQLMesh for SQL Transformations!
Просмотров 1,3 тыс.3 месяца назад
dbt Core Vs. SQLMesh for SQL Transformations!
How to Build Auto-Refreshing Analytics Pipelines with Microsoft SQL Server, PowerBI & Apache Airflow
Просмотров 4643 месяца назад
How to Build Auto-Refreshing Analytics Pipelines with Microsoft SQL Server, PowerBI & Apache Airflow