Machine Learning Interview - Implement a 2D Convolutional Filter (with Senior Meta ML Engineer)

Spotify ML Question - Design a Recommendation System (Full mock interview)

Top 6 ML Engineer Interview Questions (with Snapchat MLE)

'Concord' is being shut down... Already...

The World's Best Ever Team

Could you survive a nanosecond on the Sun?

ML System Design Mock Interview - Build an ML System That Classifies Which Tweets Are Toxic

Exponent

Просмотров 8 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 5 сен 2024

Комментарии • 10

@kaanbicakci 3 месяца назад ⁺⁷
Calling shuffle() method on a tf.data.Dataset instance before splitting datasets can cause data leakage. The dataset is reshuffled in every iteration so everytime one of those take() and skip() methods are called, the order of the gathered elements from the "dataset" is different which may introduce overlapping samples. Here's a small example (the output will be different everytime but you should see the overlap after running multiple times):
import tensorflow as tf
import pandas as pd
import numpy as np
num_rows = 10
dataset = tf.data.Dataset.from_tensor_slices(np.arange(1, num_rows + 1))
dataset = dataset.cache()
dataset = dataset.shuffle(num_rows)
dataset = dataset.batch(2)
dataset = dataset.prefetch(1)
train = dataset.take(2)
val = dataset.skip(2).take(1)
test = dataset.skip(3).take(1)
def extract_ids(ds):
ids = []
for batch in ds:
ids.extend(batch.numpy())
return np.array(ids)
train_ids = extract_ids(train)
val_ids = extract_ids(val)
test_ids = extract_ids(test)
train_val_overlap = np.intersect1d(train_ids, val_ids)
train_test_overlap = np.intersect1d(train_ids, test_ids)
val_test_overlap = np.intersect1d(val_ids, test_ids)
print("Train IDs:", train_ids)
print("Val IDs:", val_ids)
print("Test IDs:", test_ids)
print("Train-Val Overlap:", train_val_overlap)
print("Train-Test Overlap:", train_test_overlap)
print("Val-Test Overlap:", val_test_overlap)
@diegofabiano8489 5 месяцев назад ⁺¹⁰
I honestly like much better the Machine Learning system design interviews, the one with the Meta engineer where he actually applied the steps was awesome!
@mandanafasounaki2192 4 месяца назад ⁺¹
Great work, solid coding skills. The thing I would want to add is that when we use BERT tokenizer, all the information, that is required to be extracted from the text for classification, is already embedded into the vectors. A simple perceptron could work well on top of the embeddings. But your approach is great for demonstrating the development lifecycle of an ML project.
@DrAhdol 5 месяцев назад ⁺²
Something I'd like to see more from some of these ML videos are acknowledgements of approaches not leveraging NN. For something like this, you could leverage multinomial naive bayes with bag of words/tf-idf scores and get good performance with super fast inference speed as a baseline to compare the more complex NN models.
@jackjill67 5 месяцев назад ⁺⁴
First useful video... otherwise most people just talk through
@alexb2997 3 месяца назад ⁺¹
Just to represent for recurrent networks -- It's a little unfair on LSTMs to suggest they might struggle with long term dependencies for tweets. Transformers do have an easier architecture for handling long term retrieval, but LSTMs were a specifically designed variant of RNNs for handling long term dependencies. For tweet-length documents, you'd be fine. I'm not saying don't use a transformer, just don't write off recurrent models so quickly.
@TooManyPBJs 4 месяца назад
Isn't it a bit duplicative to add LSTM with BERT tokens since BERT is already sequence aware?
@alexb2997 3 месяца назад
The tokens are just simple vocab indices, there's no sequence encoding involved at that stage. The sequence magic happens within the transformer, which wasn't used here.
@user-dx4un7gg2z 5 месяцев назад ⁺¹
How did you scrape this data from twitter, twitter API has lots of restrictions. Can you please explain that.

Следующие

Автовоспроизведение

Machine Learning Interview - Implement a 2D Convolutional Filter (with Senior Meta ML Engineer)

Machine Learning Interview - Implement a 2D Convolutional Filter (with Senior Meta ML Engineer)

Spotify ML Question - Design a Recommendation System (Full mock interview)

Spotify ML Question - Design a Recommendation System (Full mock interview)

Top 6 ML Engineer Interview Questions (with Snapchat MLE)

Top 6 ML Engineer Interview Questions (with Snapchat MLE)

'Concord' is being shut down... Already...

'Concord' is being shut down... Already...

The World's Best Ever Team

The World's Best Ever Team

Could you survive a nanosecond on the Sun?

Could you survive a nanosecond on the Sun?

Top 10 Best Jumpshots in NBA 2K25 from EVERY YouTuber (Joe Knows, Double H + More)

Top 10 Best Jumpshots in NBA 2K25 from EVERY YouTuber (Joe Knows, Double H + More)

Probability, P-Value and Confidence Intervals (Full data science mock interview)

Probability, P-Value and Confidence Intervals (Full data science mock interview)

Design ChatGPT - System Design Mock Interview (with eBay EM)

Design ChatGPT - System Design Mock Interview (with eBay EM)

I Completed Grokking the ML Interview - Honest Review

I Completed Grokking the ML Interview - Honest Review

How are videos streamed live on Youtube ?

How are videos streamed live on Youtube ?

Netflix ML Question - Design a System to Predict Netflix Watch Times (Full mock interview)

Netflix ML Question - Design a System to Predict Netflix Watch Times (Full mock interview)

Recommendation Engine Design Deep Dive with Google SWE! | Systems Design Interview Question 20

Recommendation Engine Design Deep Dive with Google SWE! | Systems Design Interview Question 20

Machine Learning Question - Training AI to Detect Bots (Full mock interview)

Machine Learning Question - Training AI to Detect Bots (Full mock interview)

MLOps Mock Interview | Interview Questions for Senior MLOps Developers

MLOps Mock Interview | Interview Questions for Senior MLOps Developers

Instagram ML Question - Design a Ranking Model (Full Mock Interview with Senior Meta ML Engineer)

Instagram ML Question - Design a Ranking Model (Full Mock Interview with Senior Meta ML Engineer)

Почему-то хочется плакать

Почему-то хочется плакать

Эти Подводники НЕ СТАЛИ МОЛЧАТЬ и Рассказали Ужасную тайну!

Эти Подводники НЕ СТАЛИ МОЛЧАТЬ и Рассказали Ужасную тайну!

Жизнь на лесном хуторе. К этому хозяйственному комплексу уже страшно и подходить

Жизнь на лесном хуторе. К этому хозяйственному комплексу уже страшно и подходить

ВИРУСНЫЕ ВИДЕО / Кот делает блины 😅

ВИРУСНЫЕ ВИДЕО / Кот делает блины 😅

Раньше не обращала внимания на ЭТИ ЯГОДЫ, но мы распробовали! Черноплодная рябина - 7 РЕЦЕПТОВ

Раньше не обращала внимания на ЭТИ ЯГОДЫ, но мы распробовали! Черноплодная рябина - 7 РЕЦЕПТОВ

Антон Теляков задаёт странные вопросы прохожим

Антон Теляков задаёт странные вопросы прохожим

Я уговариваю своего друга выпить Лава Лава

Я уговариваю своего друга выпить Лава Лава

В Турции активисты надели мешок на голову американскому солдату. #турция #сша #политика #военные

В Турции активисты надели мешок на голову американскому солдату. #турция #сша #политика #военные