Text Classification using Transformers | BERT | Custom Dataset | with code

Raj Kapadia

Просмотров 17 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 2 фев 2025

Комментарии • 45

@knifeghandi 8 месяцев назад ⁺¹
Great video and Github repo. My question was "what format do I need to put my data in?" and you answered something that I couldn't find anywhere in the HuggingFace docs- simply, it depends on the model you're fine tuning. So thank you for making a video that is still one of the most clear resources a year later!
@somacode_ Год назад ⁺²
Wow, this is exactly what I was hoping to find to get started with transformers! I noticed that the other tutorials didn't include the folder structure of the code, but yours does. Thank you so much for sharing!
@sanujatharinda6525 2 года назад ⁺²
Been looking for a video to get started with transformers. Thank you very much for this...
@bhartiparmar1078 2 года назад ⁺¹
The information provided in this app is very useful for today's generation, very hard work and attempts done by the maker,kudos for such type informative matters.
@parsotamparmar6843 2 года назад ⁺¹
Very interesting and knowledgeable materials,best efforts.
@Gagglebot Год назад ⁺¹
very helpful video. If anyone else has problems with the torch sigmoid method in the get_prediction function (getting an error saying it requires two positional arguments), just create a static sigmoid method but apply autograd.Variable(method-input-variable) on the input to that method
@saimakhosa6586 2 года назад ⁺¹
V informative ,thanks
@zarahassan6058 7 месяцев назад ⁺¹
Sir in my case I've multiple columns/attributes like id, timing, name, and class etc the thing I've to do is to classify the reviews as fake or real. How can I use BERT in this case?
For the Pre-processing what should i do?
@rajkkapadia 7 месяцев назад ⁺¹
Use only those columns that are useful...
@ruizhou1243 Год назад ⁺¹
"RuntimeError: Placeholder storage has not been allocated on MPS device! "
How about this error in the last?
@rajkkapadia Год назад
Check if you have a GPU device, sometimes it fails to run on CPU...
@LearningWorldChatGPT 2 года назад ⁺¹
Amazing video! ... Thank you very much for sharing your knowledge
@sadkchris9785 2 года назад ⁺¹
In get_prediction function, you used "trainer". When I try to use my model in another ipynb file, I get error, because trainer is not initialized in new ipynb file. What is the solution of this problem? To use my model, do i have to do these processes every time?
@rajkkapadia 2 года назад
If you please watch the video till the end, I have shown this as well...
@sadkchris9785 2 года назад ⁺¹
@@rajkkapadia I have watched, still can't use get_prediction function in other ipynb file.
@sadkchris9785 2 года назад
@@rajkkapadia Can I use the function without trainer row?
@rajkkapadia 2 года назад
@@sadkchris9785 you need to pass the path to the model in the ipynb file when you create a new instance of the model...
@sadkchris9785 2 года назад ⁺¹
@@rajkkapadia I can reach my model from other ipynb file, i did everything in your video. My error is NameError: name 'trainer' is not defined. Because trainer was initialized in main ipynb file.
@EzriBenAmi Год назад ⁺¹
ran into this problem: RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cpu and cuda:0! (when
checking argument for argument index in method wrapper_CUDA__index_select)
on this line: get_prediction('I am not happy to see you.')
@rajkkapadia Год назад ⁺¹
I will check and update the code if needed...
@arkomazhar4424 2 года назад ⁺¹
Expected all tensors to be on the same device, but found at least two devices, cpu and cuda:0!
I am facing this error in the get_prediction
@rajkkapadia 2 года назад
Hi, I am not sure about the error, you can start printing each variable one by one to get the actual error...
@WoWmastersonTuralyon Год назад
I see that your model is just the default model for classification provided by the transformers library, AutoModelForSequenceClassification. Have you tried making a more complex model using Keras, for example: using a transformers layer as input followed by a number of hidden layers (RELU + dropout)? What are the situations in which such a model (more complex) should provide better results than the more basic one (default AutoModelForSequenceClassification)?
@rajkkapadia Год назад
Hi, I have not tried that yet, but we can play around, there is one point though, Transformers are made using Pytorch, while you want to use Tensorflow, I am not sure that will gel up... But we can try this approach using Pytorch...
@WoWmastersonTuralyon Год назад
@@rajkkapadia Thanks for the quick response, a PyTorch approach would be great as well! I am currently trying to solve the following task: classify emails into 6 classes. I want to use the email bodies (after carefully selecting the relevant parts of the body -- ignore links, automated messages, and so on) and the email subjects. How can you build a model that uses multiple inputs? I tried concatenating the strings into a single input, but I don't think this is the right approach, as they would lose their independency.
@rajkkapadia Год назад
@@WoWmastersonTuralyon You can use different input layers for each input and then concatenate them, make sure the dimensions are right...
@Chuukwudi Год назад
Hi @WoWmastersonTuralyon, I just thought of the same. I am trying BERT on a binary classification task. The solution provided here quickly overfits the data in less than 2 epochs. performance on evaluation data quickly becomes shit after 3 epochs. I think best approach would be to freeze the weights of BERT and add a few layers, with a bit of regularisation as needed. Have you found a way of doing this right now ?
@smitm.1342 Год назад ⁺¹
I have a headline text to body text matching and classification task. Not sure how to tokenise both columns, body text contains 4-5 lines. What could be the solution?
@زينبسالمعزيز-د2ح Год назад ⁺¹
Can I used this model to detect cyber attacks ?
@rajkkapadia Год назад
It is a text classification model...
@sayedabdulsamad1047 2 года назад ⁺¹
Hi, not able to install datasets module...
@rajkkapadia 2 года назад ⁺¹
pypi.org/project/datasets/
@sayedabdulsamad1047 2 года назад
@@rajkkapadia Thanks, I was able to install. But while training the model with three labels I faced this problem - ValueError: Target size (torch.Size([8])) must be the same as input size (torch.Size([8, 3]))
@rajkkapadia 2 года назад
@@sayedabdulsamad1047 I am not sure, make sure the size of the target, and feature is as required by the model...
@sayedabdulsamad1047 2 года назад ⁺¹
@@rajkkapadia yeah looking for some info on that only. I tried both one hot encoded and approach and normal one
@rajkkapadia 2 года назад
@@sayedabdulsamad1047 Are you trying multi class classification... Then you should look for that on Hugging Face...
@debojitmandal8670 Год назад ⁺¹
Why are you doing binary classification please do multi class and mukti label
@rajkkapadia Год назад
Hi, if you watch carefully, I have shown a way to do that as well...

Следующие

Автовоспроизведение

Fine-Tuning BERT for Text Classification (w/ Example Code)