- Видео 11
- Просмотров 761
MaguvuTechHub
Зимбабве
Добавлен 12 фев 2023
Outliers detection and handling using Z-score in Jupyter Notebook
This video discusses how to detect outliers using box plots, scatter plots, and z-scores. It also emphasizes techniques for handling outliers as part of data scrubbing or cleaning, specifically through z-score methods. This content builds on the previous video about data scrubbing.
Thresholds are:
Z-score = 3: Data points with a z-score greater than 3 are typically considered outliers on the high end.
Z-score = -3: Data points with a z-score less than -3 are considered outliers on the low end.
Thresholds are:
Z-score = 3: Data points with a z-score greater than 3 are typically considered outliers on the high end.
Z-score = -3: Data points with a z-score less than -3 are considered outliers on the low end.
Просмотров: 20
Видео
Data Scrubbing in Python - Jupyter Notebook
Просмотров 289 часов назад
This video addresses the topic of data scrubbing, also referred to as data cleaning in Python. It emphasizes techniques for managing missing values, inconsistent data, and errors in data types or capture. Additionally, it covers the detection and handling of outliers using various methods, including IQR, box plots, and scatter plots. The video also aims to clarify the distinction between data s...
Exploratory Data Analysis in Python using Jupyter Notebook
Просмотров 3516 часов назад
This video provides an overview of Exploratory Data Analysis (EDA) in Python. It discusses the definition of EDA and the strategies that can be employed during the exploratory phase. Additionally, it highlights the significance of EDA in the context of machine learning prior to model development. The video includes guiding questions to consider while conducting EDA and concludes with a practica...
Introduction to Machine Learning
Просмотров 8419 часов назад
This video covers the fundamentals of machine learning, aiming to address several key questions: What is machine learning? What is big data? What are the characteristics of big data? What are the various machine learning techniques? And what constitutes the machine learning life cycle?
Installing packages in Python environment
Просмотров 1121 час назад
This video focuses on the installation of Python packages using Conda. We will explore how to list packages that are already installed within a specific environment, as well as how to install new packages in that environment.
Jupyter Notebook Walkthrough
Просмотров 2321 час назад
This video provides a comprehensive walkthrough of Jupyter Notebook utilizing JupyterLab. It covers essential topics including launching the environment, changing themes, enabling line numbering, interrupting or stopping the kernel, formatting headings and text, loading images, and creating list and table layouts in Markdown.
Setting out Jupyter Notebook Environment
Просмотров 1521 час назад
Introducing the Jupyter Notebook Environment using Anaconda Distribution on windows machine. Demonstrating how to create a new python environment using the anaconda prompt and installation of Jupyter lab. This video will cover how to activate new python environment, change working directory on the machine, installation of Jupyter lab using Conda management, and then finally launching Jupyter lab
Decision Tree Algorithm
Просмотров 91Год назад
Decision Tree using ID3 Algorithm with a numerical example. Implementing the Shannon Entropy and Information Gain to determine the root node or next node. You can support me through buying me a Rose: www.buymeacoffee.com/edlightm
Apriori Algorithm explained
Просмотров 158Год назад
A step-by-step explanation of the Apriori Algorithm using a numeric example. Making use of the Associations Rules metrics; Support, Confidence and Lift. You can support me through buying me a Rose: www.buymeacoffee.com/edlightm
Association Rule Mining Explained
Просмотров 74Год назад
An explanation of how the Association rules are generated using Support, Confidence, and Lift metrics making using an example. You can support me through buying me a Rose: www.buymeacoffee.com/edlightm
PageRank Algorithm Explained
Просмотров 229Год назад
Explanation of how the PageRank algorithm works using the Random Surfer Model with a given numerical example. You can support me through buying me a Rose: www.buymeacoffee.com/edlightm
Link to DataSet used from Kaggle www.kaggle.com/datasets/fedesoriano/stroke-prediction-dataset
thank you very much for this explanation
Glad it was helpful!