Attrs, Pydantic, or Python Data Classes?

Поделиться
HTML-код
  • Опубликовано: 4 янв 2025

Комментарии • 127

  • @ArjanCodes
    @ArjanCodes  Год назад +1

    💡 Get my FREE 7-step guide to help you consistently design great software: arjancodes.com/designguide.

  • @BPopes
    @BPopes Год назад +77

    Answering your question in the video about Pydantic validation (~ 14:53 ), pydantic's default mode is to validate on instantiation only. But you can set validate_assignment=True in the ConfigModel of your model to validate when you assign as well.

    • @drheaddamage
      @drheaddamage Год назад +4

      I don't get the purpose of this. When would you want to validate on instantiation, but not on assignment? Sounds like a good way to complicate your debugging if you assume a variable is a particular type and then turns out to be overwritten with something completely different.

    • @palto-ai
      @palto-ai Год назад +14

      @@drheaddamage I guess the use case for only validating at instantiation is that after that you may trust it based on the code and not validating later may provide performance benefits. The instantiation could be with external data (user input, config file, API data) while the subsequent assignments are within the code itself (e.g. calculations)

    • @ripichipina
      @ripichipina Год назад +7

      Also validation on every assignment is quite expensive operation for pydantic.
      There is a pattern when you create only unchangeable instances. Like using dataclasses ‘frozen’ init parameter.
      And if you want to change instance, you should create a new one.

    • @Nobody313
      @Nobody313 Год назад +2

      @@drheaddamage Validation on assigment could be very expensive. If you use a functional approach, you will never modify data (objects) just create new ones, so makes sense.

    • @piotrgretkierewicz3906
      @piotrgretkierewicz3906 Год назад +1

      @@drheaddamage It is not needed with frozen classes and those are extensively used in some projects. I think there was some argument that this is a bit more secure way of keeping data - creating new objects over editing existing one

  • @romanzaiev
    @romanzaiev Год назад +4

    Thanks for the video! We use both dataclasses and pydantic in our projects for 2 different cases. Pydantic is more than a schema validation library today but we consider it should be used as a schema validation tool only. So, we use it extensively for the request/response schemas and some other things. On the other hand we have data models that we map on the corresponding DB tables and we use dataclasses for that. Basically, the standard flow is request -> pydantic validation -> some logic and dataclass models -> pydantic validation -> response. Sometimes it may be handy to map the schema to the model data automagically and for that case there is an orm_mode flag in pydantic. I just want to say they are not competitors at all.

  • @joaopedrorocha5693
    @joaopedrorocha5693 Год назад +26

    Great video! The idea is great ... comparing tools and them giving your nuggets of wisdom on which one to choose for each job ... It's great to have a glimpse of your experience while choosing tools.
    Suggestion for a next one: comparison between web frameworks (such as fast api, django, flask)

  • @EricChenx
    @EricChenx Год назад

    Thanks!

  • @realplod
    @realplod Год назад

    Thanks

  • @timorieber
    @timorieber Год назад +13

    Hi Arjan, thanks for your effort you put in this videos, I got a lot of inspiration from them in the last years!
    I wanted to add, that Pydantic also has a dataclass decorator which is a drop-in replacement for the standard dataclasses, with all the validation features available as for the Pydantic BaseModel. Perhaps that would have been the more comparable choice.
    Keep up the great work!
    Timo

    • @evandrofilipe1526
      @evandrofilipe1526 Год назад +1

      It has all the features of pydantic without inheritance?!

    • @timorieber
      @timorieber Год назад

      @@evandrofilipe1526 it has all the validation features, but it is not a replacement for the Pydantic BaseModel. I recommend their docs for further details.

    • @palto-ai
      @palto-ai Год назад +4

      One thing to note is that the Pydantic dataclass has some restrictions compared to the Pydantic BaseModel features. Would be nice to compare the Python built-in dataclass, the Pydantic dataclass and the traditional Pydantic BaseModel.

  • @rx97_mc
    @rx97_mc Год назад +2

    You've been releasing banger videos one after another, i swear, I link or get linked your video hours after release!

  • @osjerick
    @osjerick Год назад +1

    Awesome video! I use dataclasses and Pydantic a lot. I'll take a look at attrs now. Adding Mypy to my dev workflow has also helped me deliver better code.

  • @guyonlead
    @guyonlead Год назад +6

    I’m been rocking Pydantic for about a month now and I’m absolutely in love with it. It’s simpler to use than data classes IMO when using REST API. Granted I have a bunch of base models into another class then I use my own methods to manipulate the data but it works out nicely in the end.

    • @dpav02
      @dpav02 Год назад

      Agreed. Pydantic also works extremely well when using an ORM like Sqlalchemy.

  • @horoshuhin
    @horoshuhin Год назад +15

    Arjan it would be awesome if you'd interview the creator of pydantic. It would be a fantastic episode of "Become A Better Software Developer" or something like that. I don't know if you've thought about this kind of format. I'd love to see pros dive into the nitty gritty and share ideas or the way to do stuff. Just a thought.

    • @ArjanCodes
      @ArjanCodes  Год назад +1

      Thanks for the suggestion!

  • @tjanos88
    @tjanos88 Год назад

    Thank you, I learned a lot again. Have a nice weekend!

  • @hexomega9445
    @hexomega9445 Год назад

    Thanks so much to bring this to the table!

  • @dorb1337
    @dorb1337 Год назад +2

    Great video ! a question that pops - Where are you getting your knowledge from ? reading the documentation is nice, and playing around with a module or a library is great - but aren't you afraid to miss out few of the features ? are there any good forums or knowledge resources you're using to stay up to date and understand all the possible features of a library or module you use ?

  • @domenechj
    @domenechj Год назад +3

    Great video as always! By the way, what library do you prefer for performing validation on dataframes?

    • @babc142
      @babc142 Год назад

      I've used pandera pretty heavily in production and it's very capable and the developer is super helpful. But I haven't explored pydantic extensively, and I feel there might be some advantages to using it in exchange for writing quite a lot more code

  • @cppsage
    @cppsage 8 месяцев назад

    And another nit in the attrs example, and this may very well be my misunderstanding of str, but instead of lower should rather we use casefold for comparisons? As I understand that is dedicated for that purpose.

  • @shS83
    @shS83 9 месяцев назад

    Great video. Good explanation for using int in price. This, like your other videos, come in handy in lots of different scenarios. Thank you for your contribution.

    • @ArjanCodes
      @ArjanCodes  9 месяцев назад

      I'm happy to hear you're enjoying the content!

  • @viajeradelaluz
    @viajeradelaluz Год назад

    Great Video! I actually started using dataclasses when I saw your video about it few months ago. I 'd like you to make a short video with pros and cons about your recommendation to use int() instead of float() type for prices fields. You left me thinking about that idea. Many thanks! 😋

  • @simmonslucas
    @simmonslucas Год назад +2

    You already convinced me of dataclasses. They spread across our code base like a slime mold.

  • @pamdemonia
    @pamdemonia Год назад

    Just a late thank you for your dataclasses video which taught me about the __post_init__() method which really came in handy recently.

  • @garrettwill1220
    @garrettwill1220 4 месяца назад

    for pydantic not printing the object type just do print(repr(banana)) and it will put it in a nice form that looks exactly like you would create it in code
    repr calls the __repr__ dunder method which all python classes inherit from default object. In python print calls __str__, which unless overridden actually calls __repr__. If you have ever printed an object to terminal and you get something like: its because it __str__ has not been overwritten and it has inherited __repr__ from object.
    I guess somehow youtube is messing with my underscores and turning my text italics but I think its still understandable.

  • @grzegorzbajson3897
    @grzegorzbajson3897 Год назад +4

    It's worth to mention the performance. Dataclasses are way faster than BaseModel (I learnt this the hard way :-/). There are some improvements expected with Pydantic 2.0, but for now:
    In [1]: from dataclasses import dataclass
    In [2]: @dataclass
    ...: class A:
    ...: a: int = 1
    ...: b: str = "one"
    ...:
    In [3]: %timeit a = A()
    80.5 ns ± 0.212 ns per loop (mean ± std. dev. of 7 runs, 10,000,000 loops each)
    In [5]: from pydantic import BaseModel
    In [6]: class B(BaseModel):
    ...: a: int = 1
    ...: b: str = "one"
    ...:
    In [7]: %timeit b = B()
    1.03 µs ± 3.7 ns per loop (mean ± std. dev. of 7 runs, 1,000,000 loops each)

  • @MedievalChips
    @MedievalChips Год назад

    Thank you for the great video. Love the deep dive in new packages.

  • @Patrick-hl1wp
    @Patrick-hl1wp Год назад

    Super helpful information, thanks Arjan

  • @mariamozgunova9584
    @mariamozgunova9584 Год назад

    In Python, one can also use `Decimal` to represent monetary amounts

  • @helish_88
    @helish_88 Год назад

    My english is not so good, usually I don't understand all but ur speech is so good, I understood almost everything, btw its very helpful video

  • @cppsage
    @cppsage 8 месяцев назад

    Can’t read all comments, but I think I saw a bug in the dataclass example. I think if your code would ever run through a midnight you’d need that default for the order date to also be dynamic with default_factory, otherwise it’ll be the date of the creation of the *class* and not the instance. So if I started my program December 2023, all my orders will come up with that date by default.

  • @GeorgelPreput
    @GeorgelPreput Год назад +1

    Would be great to have a follow-up which compares validation in pydantic, param and traitlets

  • @SuperDoc3000
    @SuperDoc3000 Год назад

    Thanks @ArjanCodes, quality content every time!

  • @danimperiale8545
    @danimperiale8545 Год назад

    Dude, great video. Just really really great.

  •  Год назад

    As always, awesome video!
    Thank you for the great content. I'm improving a lot my developer skills with your lessons.

  • @KoyaMel
    @KoyaMel Год назад +5

    literally 5 minutes ago, I was in the situation to decide which to use. (you read my mind)

    • @ArjanCodes
      @ArjanCodes  Год назад +1

      Haha, so I was just in time :).

    • @daverichardhadley
      @daverichardhadley Год назад +2

      What did you choose?

    • @TheSzybas
      @TheSzybas Год назад +3

      @@ArjanCodes because the wizard is not late nor early. Just in time.
      Like a compiler...

    • @KoyaMel
      @KoyaMel Год назад +1

      dataclass for the win

  • @matercomus
    @matercomus Год назад

    Great video!

  • @potens1
    @potens1 Год назад

    About the representation in pydantic, instead of printing the banana, you can do print(repr(banana)), you will get Banana(name='banana', category='fruit'....).

  • @GigaMarou
    @GigaMarou Год назад +1

    Hey Arjan, great video. How would you use dataclasses, attrs or pydantic to validate Tabular Machine Learning data. Greetings from Germany :)

  • @hargovind2776
    @hargovind2776 Год назад

    I hit like even before starting your video. Already know the content's gonna be amazing, keep doing the awesome work!!!!

  • @mithunmanoharmithun
    @mithunmanoharmithun Год назад

    Awesome content as always! Could you please make video on how to persist these objects into a datbase and best practices to do so ?

    • @ArjanCodes
      @ArjanCodes  Год назад

      Thanks! Your suggestion is noted :)

  • @TomTrval
    @TomTrval Год назад

    Great overview thank you :) Personal preference: dataclass with `slots=True`, I hope we get build in object pooling like `pool=300` one day for for dataclasses

  • @CarlosMorenoV
    @CarlosMorenoV Год назад

    Nice video, I realised about attrs because of Fluent Python 2nd edition by Luciano Ramalho. There is an entire section in that book about Data classes. If you ask me, I prefer to use attrs or Pydantic. You can use dataclass to prototype an initial version of some tiny app. However, once you need to build something more professional, you definitely need to go beyond. And Attrs or Pydantic is the correct choice.

  • @lawrencedoliveiro9104
    @lawrencedoliveiro9104 Год назад +1

    7:38 As soon as you said it uses subclassing, I immediately thought “there must be a metaclass involved”.
    Had a look at the source code, and yes, that is how it works.

  • @yishan9598
    @yishan9598 3 месяца назад

    I have a question about dataclass VS normal class. When I use dataclass define one kind of data, at the beginning it is perfect. However, then more and more methods were added based on the attributes of the dataclass and the class is becoming more and more heavy and some of the methods doing something very complicated. I am not sure whether this is a good practice and whether I should convert the dataclass to a normal class. Could you please give me some suggestion about the use case of normal and data classes?

  • @NopeNotMeNoSiree
    @NopeNotMeNoSiree Год назад +1

    I had chosen pydantic for a few projects, but now I'm ripping it all out because of the incompatible API changes brought by 2.0. It appears the authors of pydantic are _very_ opinionated about how method names should be formatted, so they cavalierly replaced parse_raw by model_validate_json. Not only is the function name much longer, but it apparently does exactly the same thing. If I keep pydantic in the project, then it's like a ticking time bomb for future developers.

  • @MrVernuk
    @MrVernuk 9 месяцев назад

    Thank you fot this video! It's really helpfull ☺

    • @ArjanCodes
      @ArjanCodes  9 месяцев назад

      I'm glad to hear the video was helpful!

  • @whkoh7619
    @whkoh7619 Год назад

    thanks for the compare! the real content is here as always ;)

  • @lawrencedoliveiro9104
    @lawrencedoliveiro9104 Год назад

    4:50 Sounds like defining key fields in a database record.

  • @lawrencedoliveiro9104
    @lawrencedoliveiro9104 Год назад +1

    4:07 Why not use the Decimal type?

  • @kosmonautofficial296
    @kosmonautofficial296 Год назад

    When you printed that banana it totally made sense to me

  • @PavithraSuLa
    @PavithraSuLa Год назад

    Awesome video! Thank you for the great content.
    Could you please make video about python metaprogramming and metaclass with real world example.

    • @ArjanCodes
      @ArjanCodes  Год назад

      Thanks for the suggestion it's noted!

  • @cppsage
    @cppsage 8 месяцев назад

    Hehe, this is REALLY a NIT. I was told that zero is neither negative or positive, that function in the attr example is non-negative, not positive. (Of course with IEEE-754 we can get zero BOTH negative or positive, but that is only useful for a zero that isn’t really zero, just too small)

  • @giovanniosorio4759
    @giovanniosorio4759 Год назад

    I haven’t been able to understand Enum classes and what exactly are they useful? I've been looking for videos that explain it well but there aren't any. Could you please make one?

  • @JohnFKenobi
    @JohnFKenobi Год назад

    great stuff

  • @kennethgee2004
    @kennethgee2004 4 месяца назад

    strange that attrs is the base class for dataclass and the same functions for validation are not inherited. is attrs using a different package to add in validation? What about just using a validation class in the first place as this is a time when inheritance is a good thing?

  • @tanjt107
    @tanjt107 Год назад

    I am not sure if why question is clear and related to this video. Sometimes in my classes there are a lot of lines in the init/post_init method. The reason of this is I don’t want the same thing being calculated many times and saving the result in memory can speed up the process. Maybe I should just use function instead?

    • @ArjanCodes
      @ArjanCodes  Год назад +1

      If it's a complicated computation, I would suggest to move it out of the init method and into a separate function. You could also look into functools cached_property decorator. This computes a value once and then memoizes it.

    • @tanjt107
      @tanjt107 Год назад

      @@ArjanCodes Thanks! I never know this decorator. It helps!

  • @evandrofilipe1526
    @evandrofilipe1526 Год назад

    I would like to know what you think about typeguard module since I never seen you use it.

  • @LeftLib
    @LeftLib Год назад

    as a newcomer I was hoping to find The Answer, what I should be using beginning from now. After watching this I am not sure, it seems odd to have a choice of 3. What would it be like 5 years from now?

  • @SimeonNedkov
    @SimeonNedkov Год назад +1

    "Don't be an attr (etter)..." is a great way to start the week! 😁

  • @CrazyDalekCaan
    @CrazyDalekCaan Год назад

    You missed something important: pydantic has its own dataclass decorator which can be used much like a BaseModel, but is fully API compatible with a built-in dataclass

  • @WilliS1651686868
    @WilliS1651686868 Год назад

    I really like the "frozen" option of dataclass that allow to have some sort of immutability. Does attrs or pedantic have something similar?

    • @johantoday
      @johantoday Год назад

      Pydantic supports frozen fields or frozen instances

  • @OzFush
    @OzFush Год назад

    What brand is your hoodie? I want one but can’t find the logo in google image search 😂

  • @Vijay-Yarramsetty
    @Vijay-Yarramsetty Год назад

    I've been having this doubt since a long time. which one among them is better?

  • @WilliS1651686868
    @WilliS1651686868 Год назад

    Are there any difference between those three options in terms of performance?

  • @ArtyomShakurov
    @ArtyomShakurov Год назад

    Thanks! Can you please compare pydantic and marshmallow?

  • @AngDmz
    @AngDmz Год назад

    With pydantic you have super powerful objects factory to perform really good OOP without much effort

  • @jkrigelman
    @jkrigelman Год назад

    Now I need to learn what atter is in Dutch.

  • @PraecorLoth970
    @PraecorLoth970 Год назад +1

    Nice video, cheers from a country where taxes aren't limited to 100% 😭

  • @Vodkarh
    @Vodkarh Год назад +26

    ok now I want to know what attrs means in dutch

    • @RutNij
      @RutNij Год назад +4

      The dutch word is 'etter', which is pronounced similar to attr in english. Etter translates more or less to 'jerk' in english 😂

    • @HanWechgelaer
      @HanWechgelaer Год назад

      Actually attr sounds like the Dutch word "etter" which means a brat who is acting up.🙂

    • @samsung40_media87
      @samsung40_media87 Год назад

      in afrikaans, it is pretty bad

    • @evandrofilipe1526
      @evandrofilipe1526 Год назад

      ​@@samsung40_media87 what does it mean?

    • @JoergAsmussen
      @JoergAsmussen Год назад

      he is een etter -> he's a jerk (according to Google translate)

  • @esmaelmohamed8209
    @esmaelmohamed8209 Год назад

    Arjan how I communicate you officially for your courses?

    • @ArjanCodes
      @ArjanCodes  Год назад

      To talk about my courses you can send an email to: support@arjancodes.com

    • @esmaelmohamed8209
      @esmaelmohamed8209 Год назад

      @@ArjanCodes Okay Arjan I will. ❤

    • @esmaelmohamed8209
      @esmaelmohamed8209 Год назад

      @@ArjanCodes Now I did that

  • @ErikS-
    @ErikS- Год назад

    I honestly used dataclasses for some time, but I stopped using them.
    I prefer having all classes written according to the standard rules for classes. So no instance variables in the part where you would standard see the class variables defined.
    Instead of mixing the dataclasses into regular classes, It would have been better if a new type of datastructure would be implemented in Python, and which is available by DEFAULT. Making the distinction with a decorator just doesn't do it for me... I like to say that pydantic is by the way a 'cleaner' solution imo...

  • @ac130kz
    @ac130kz Год назад

    attrs is excellent, but it's kinda not compatible with everything else, so it's an all in approach

  • @proofit404
    @proofit404 Год назад

    Лучший!

  • @nixonkutz3018
    @nixonkutz3018 Год назад

    I always get tickled by the volume and popularity of code that attempts to bolt type checking onto a duck-typed language

  • @Blingblingblingling
    @Blingblingblingling Год назад

    Curious. Why not, like price being int, make weight also int and represent the smallest unit we care about (like gram)?

  • @_Amilio_
    @_Amilio_ Год назад +1

    I forgot that "attrs" in dutch was niet zo leuk 😂

  • @drheaddamage
    @drheaddamage Год назад +2

    I like attrs a lot, but it is a bit funny to me that we say: "Python is great because it is dynamically typed", and then the first thing we do is strictly typing using one of these packages...

  • @julienyt1600
    @julienyt1600 Год назад

    Pydantic's Soo good to ensure that your python objects can easily become dicts, serialize in JSON, parse_raw query results and validate at the same time.
    I use pydantic all the time so other developpers only need to learn them and not have to deal with the differences coming in dataclasses fields

  • @dalaymann
    @dalaymann Год назад

    한국인 중에 Pydantic 을 가르쳐 줄 수 있는 사람은 대개 미국에 살고 있으니... 감사합니다.

  • @HanWechgelaer
    @HanWechgelaer Год назад

    Nice video, now don't be a attr and like the video 😂😂😂