Sean Law - Modern Time Series Analysis with STUMPY - Intro To Matrix Profiles | PyData Global 2020

Поделиться
HTML-код
  • Опубликовано: 27 окт 2024

Комментарии • 13

  • @TerritOrially
    @TerritOrially 7 месяцев назад

    🎯 Key Takeaways for quick navigation:
    00:04 *🌐 Introduction to STUMPY and Modern Time Series Analysis*
    - Introduction by Sean Law, the creator and core developer of STUMPY, highlighting the agenda of the presentation on modern time series analysis using STUMPY.
    - Sean Law thanks the PyData Global organizers and TD Ameritrade, his employer.
    - Challenges in time series data analysis are outlined, emphasizing the difficulty in visualizing and analyzing large datasets.
    02:08 *📊 Common Approaches in Time Series Analysis*
    - Overview of traditional methods in time series analysis, including visualization, statistics, auto-regressive models, anomaly detection, forecasting, machine learning models, unsupervised learning, dynamic time warping, and deep learning.
    - Each method's limitations and the necessity for a scalable and intuitive solution are discussed.
    04:01 *📘 Illustrative Example of Time Series Analysis*
    - Introduction to basic concepts in time series analysis, such as subsequences, and the goal of identifying conserved behaviors or patterns.
    - The presentation outlines the importance of a simple and intuitive approach that is easy to interpret, user and data agnostic, requires no prior knowledge, and is (nearly) parameter-free.
    06:50 *🔍 Comparing Subsequences Using Euclidean Distance*
    - Explanation of how to compare subsequences in a time series using Euclidean distance.
    - The process of computing a distance matrix for time series data is detailed, illustrating the computational challenge it presents for large datasets.
    09:35 *🧮 Introduction to the Matrix Profile Concept*
    - Introduction to the matrix profile concept, a transformative approach that simplifies the analysis of time series data by focusing on the nearest neighbors of subsequences.
    - The matrix profile's ability to identify motifs (repeated patterns) and discords (anomalies) in time series data is highlighted.
    13:47 *📈 Using the Matrix Profile for Analysis*
    - Practical applications of the matrix profile in identifying conserved behaviors, motifs, and potential anomalies within time series data.
    - The significance of the matrix profile index and how it aids in locating the nearest neighbors for subsequences is discussed.
    16:34 *💻 Development and Impact of STUMPY*
    - The evolution of algorithms to efficiently compute the matrix profile, highlighting the contributions of the STAMP, STOMP, and GPU-accelerated STOMP algorithms.
    - Introduction of STUMPY, a Python library developed to validate and implement matrix profile algorithms for efficient time series analysis.
    17:59 *🚀 STUMPY: A Powerful Tool for Time Series Analysis*
    - STUMPY is highlighted as a scalable Python library for computing the matrix profile, enabling efficient time series analysis without reinventing the wheel.
    - STUMPY's growth and community support: over 50,000 downloads, 1,500 GitHub stars, and current version 1.5.
    - Features and capabilities: minimal dependencies, compatibility with modern Python, parallelization across server cores, Dask cluster support for distribution across multiple servers, and performance benchmarks (256 CPUs across 32 servers computed a 100 million data point sample matrix profile in under 10 days).
    21:20 *📊 Live Demo: Analyzing Time Series with STUMPY*
    - A live demonstration showcases STUMPY's ability to identify patterns and anomalies in time series data through the computation of matrix profiles.
    - Detailed examination of a specific pattern within the time series, its repetition, and the process of identifying its nearest neighbor.
    - The demo highlights the matrix profile's role in detecting motifs (conserved patterns or behaviors) and discords (potential anomalies) within the time series, illustrating how these insights guide where to focus analysis.
    26:39 *📘 STUMPY Documentation and Resources*
    - Overview of STUMPY's extensive documentation, tutorials, and API, which support users in efficiently utilizing the library for time series analysis.
    - The versatility of STUMPY is showcased, including its application to multi-dimensional matrix profiles, time series chains, semantic segmentation, and the comparison and clustering of time series data.
    - Encouragement for community contribution, communication about STUMPY, and utilization of the library, alongside links to tutorials, live demos, and open-source code repositories.
    Made with HARPA AI

  • @lu.quit.a
    @lu.quit.a 2 года назад +3

    This was awesome, very good introduction to Matrix Profiles!

  • @MrStephcaster
    @MrStephcaster 9 месяцев назад

    awesome! I can't wait to test it

  • @jefferyanderson
    @jefferyanderson 2 года назад +2

    Excellent presentation Sean

  • @abolfazlzeraatkar2425
    @abolfazlzeraatkar2425 10 месяцев назад

    very good and really ease to learn, thanks

  • @stephanembatchou5300
    @stephanembatchou5300 2 года назад +2

    That was a great presentation.

  • @SreeramAjay
    @SreeramAjay Год назад +1

    Very nice explanation, thank you

  • @qdhfhwiwiir
    @qdhfhwiwiir 7 месяцев назад

    Good work. But how can it be combined with other methods (clustering,ml, arima etc.) I thought this is indivisual method that parallel to those methods.

  • @yeongnamtan
    @yeongnamtan Год назад

    that was a very interesting presentation. Would you be able to share your code so we could play around with stumpy ?

  • @gianmarcosalvi339
    @gianmarcosalvi339 Год назад

    Is it possible to apply AB join to timeseries with different granularity data? i.e. T_a has data sampled every 200ms and T_b has data sampled every 10s

    • @21jvg2
      @21jvg2 7 месяцев назад

      I think you can try to generate some noice between, or use dynamic warping to match

  • @RyuuOujiXS
    @RyuuOujiXS 9 месяцев назад

    subsequence in series is how you can tell someone is stupid. It should be subseries.

    • @maxgonchar6080
      @maxgonchar6080 2 месяца назад

      subsequence of datapoints sounds perfect to me