End-to-End Data Analytics Project: SQL, Python-ETL & Power BI in FinTech

Поделиться
HTML-код
  • Опубликовано: 6 фев 2025
  • End-to-End FinTech Data Analytics Project:
    📊 𝗣𝗿𝗼𝗷𝗲𝗰𝘁 𝗢𝘃𝗲𝗿𝘃𝗶𝗲𝘄:
    This project focused on analyzing transactional data to extract actionable insights using:
    Source Data→ CSV files & SQL queries
    Database→ MySQL, managed via DBeaver
    ETL Pipeline→ Built and automated with Python
    Data Analysis→ Performed with SQL
    Visualization→ Insights showcased in Power BI
    🔑 Key KPIs:
    1️⃣ Total unique customers
    2️⃣ Month-wise transaction analysis
    3️⃣ Regional and area-wise customer distribution
    4️⃣ Transaction type analysis with balances, percentages
    5️⃣ Monthly active customers (MAC)
    6️⃣ Customer-wise and month-end closing balances
    7️⃣ Latest 5-day deposits & withdrawals
    8️⃣ 5-day interval transaction analysis
    9️⃣ Weekly comparison of deposits & withdrawals
    🔟 15-day forecast for deposits & withdrawals
    ✨ 𝐎𝐮𝐭𝐜𝐨𝐦𝐞: Delivered a robust analytics solution for FinTech decision-makers to monitor performance, identify trends, and forecast effectively.
    📌 𝗣𝗿𝗼𝗷𝗲𝗰𝘁 𝗣𝗿𝗼𝗴𝗿𝗲𝘀𝘀 :
    🎯 Phase 1: 𝗗𝗮𝘁𝗮𝗯𝗮𝘀𝗲 𝗦𝗲𝘁𝘂𝗽 𝗮𝗻𝗱 𝗘𝗧𝗟 𝗣𝗿𝗼𝗰𝗲𝘀𝘀
    1. Created two databases in MySQL:
    → LIVE Database
    → WH Database
    2. Performed the following data operations:
    → Inserted data for the 𝗥𝗲𝗴𝗶𝗼𝗻 and 𝗔𝗿𝗲𝗮 tables in the LIVE database, load from the LIVE database to the WH database using full truncate.
    → Imported the 𝗰𝘂𝘀𝘁𝗼𝗺𝗲𝗿𝘀_𝘁𝗿𝗮𝗻𝘀𝗮𝗰𝘁𝗶𝗼𝗻 table into the LIVE database, then loaded it incrementally into the WH database.
    → Extracted the 𝗰𝘂𝘀𝘁𝗼𝗺𝗲𝗿_𝗷𝗼𝗶𝗻𝗶𝗻𝗴_𝗶𝗻𝗳𝗼 table directly from a drive and loaded it into the WH database with full truncate ETL process using Python.
    🎯 Phase 2: 𝗦𝗼𝗹𝘃𝗶𝗻𝗴 𝗞𝗣𝗜𝘀 𝘄𝗶𝘁𝗵 𝗦𝗤𝗟
    Used SQL to calculate all KPIs by employing a variety of techniques, including:
    → Different statements, operators, and functions
    → Aggregated functions and GROUP BY clauses
    → Various types of joins
    → Window functions
    → Sub-queries and Common Table Expressions (CTEs)
    🎯 Phase 3: 𝗟𝗼𝗮𝗱𝗶𝗻𝗴 𝗗𝗮𝘁𝗮 𝘁𝗼 𝗣𝗼𝘄𝗲𝗿 𝗕𝗜
    Imported datasets from MySQL databases into Power BI using the ‘localhost' Server and ‘wh_online_banking’ database via SQL statements.
    🎯Phase 4: 𝗗𝗮𝘁𝗮 𝗠𝗼𝗱𝗲𝗹𝗶𝗻𝗴 𝗮𝗻𝗱 𝗩𝗶𝘀𝘂𝗮𝗹𝗶𝘇𝗮𝘁𝗶𝗼𝗻 𝗶𝗻 𝗣𝗼𝘄𝗲𝗿 𝗕𝗜
    1. 𝗗𝗮𝘁𝗮 𝗠𝗼𝗱𝗲𝗹𝗶𝗻𝗴:
    → Created required measures, calculated columns, and custom columns in Power Query using M-Language.
    2. 𝗩𝗶𝘀𝘂𝗮𝗹𝗶𝘇𝗮𝘁𝗶𝗼𝗻𝘀:
    → Chose appropriate visuals for better representation:
    Card, Card (New), Area Chart, Bar Chart, Column Chart, Donut Chart, and Matrix (Table)
    → Added tooltips, drill-throughs, and page navigation for detailed insights.
    → Used slicers, tile slicers, bookmarks, and conditional formatting for dynamic visualization.
    3. 𝗧𝗶𝗺𝗲 𝗦𝗲𝗿𝗶𝗲𝘀 𝗔𝗻𝗮𝗹𝘆𝘀𝗶𝘀:
    Used Line Chart with forecasting advanced functions.
    🎯 Phase 5: 𝗗𝗲𝗽𝗹𝗼𝘆𝗺𝗲𝗻𝘁
    → Developed an interactive dashboard in Power BI.
    → Published it on the Web Service for live browsing.
    𝐂𝐨𝐧𝐜𝐥𝐮𝐬𝐢𝐨𝐧:
    To improve the system, negative balances should be addressed as they are entirely forbidden and unexpected in financial operations. Controlling this issue is critical for maintaining data integrity.
    --------------------------------------------------------------------------------------
    #DataAnalytics #SQL #Python #ETL #PowerBI #FinTechAnalytics #ETLPipeline #MySQL #DataVisualization #BusinessIntelligence #PowerBIDashboard #KPIs #DataModeling #TimeSeriesAnalysis #InteractiveDashboard #DataIntegrity #FinancialAnalytics #TransactionalData #Forecasting #DataPipeline #DAX

Комментарии • 2