the cleanest feature in C that you've probably never heard of

Поделиться
HTML-код
  • Опубликовано: 26 янв 2025

Комментарии • 557

  • @CoolestPossibleName
    @CoolestPossibleName Год назад +1424

    I discovered union when I tried to name a function union

    • @mingy7949
      @mingy7949 Год назад +46

      I discovered it from dwm

    • @stapler942
      @stapler942 Год назад +57

      I think I learned about unions from some book when I decided to learn C.
      Two keywords that are probably even more obscure are "register" and "volatile".

    • @AustinClemLive
      @AustinClemLive Год назад +29

      This is how I discovered the "register" type modifier

    • @filipstojanovicmechanicale9265
      @filipstojanovicmechanicale9265 Год назад +13

      I discovered register keyword when i tried to name some sensor register as "register"

    • @nallid7357
      @nallid7357 Год назад +14

      @HyperWin Just to add in for the conversation, that keyword tells the compiler not to change any of the code that is written, as that code may be used asynchronously, is changed by the hardware itself, or the software, such as a signal handler, changes the value itself. I used to use it for threads, because some of the code needed to work as intended and not moved or modified by the compiler for optimization purposes that will create undefined behavior.

  • @stapler942
    @stapler942 Год назад +297

    They say big corporations employ people to go into their repos and replace all C unions in commits with other corporate-approved solutions. They're known as "union busters".

    • @DavidLindes
      @DavidLindes 10 месяцев назад +10

      Hah. If only union busting was as relatively benign as that... alas.

    • @connor3010
      @connor3010 4 месяца назад

      oh my

    • @rafagd
      @rafagd 3 месяца назад +2

      Take my like and go home.

  • @eduardobarreto6116
    @eduardobarreto6116 Год назад +458

    unions are also very useful for dealing with communication packets, where you have a byte array that represents your entire packet, along with each item present within it. This way you can access the entire package (to send to other parts of your program), as well as each separate item

    • @eduardobarreto6116
      @eduardobarreto6116 Год назад +5

      @@TheCarmacon I don't know if I understand your last point about consistency... consistency in communication can be achieved in other ways such as validating the package in N ways before reaching the union... Did you mean anything after that? Like any conversion problem? I would love to know more about it

    • @0xDEAD_Inside
      @0xDEAD_Inside Год назад +1

      I need an example of it. Can someone show me some demo code to understand how it works?

    • @BacklTrack
      @BacklTrack Год назад +3

      that's the only time I've used em

    • @GoofyChristoffer
      @GoofyChristoffer Год назад +2

      This is exactly what I've used unions for in communications between two embedded processors

    • @neniugrava
      @neniugrava Год назад

      ​@@0xDEAD_InsideLook at 3:57 in this video. In his code he's using the union to allow the hardware register to be accessed as different data types, but you use the same strategy when decoding communication protocols. You usually have an enum to represent the type of the payload, and you know which of the union'd structs to access based on the message type.
      It's not a very good way to do this, though, because compiler and/or endianness differences between processors can lead to the same C code parsing the data differently.
      It's convenient to access data this way, but you have to rely on compiler-specific features to make it reliable. And like with bitfields the compiler may be creating a lot of code behind the scenes, so you aren't really saving anything over just writing your own robust parsing code.

  • @hymnsfordisco
    @hymnsfordisco 11 месяцев назад +16

    The data conversion examples are cool, but it's important to note that they depend on the byte order ("endianness") and memory size of int, which can both change depending on what platform you're on.

    • @cainabel2553
      @cainabel2553 3 месяца назад

      If you end up depending on byte order, without using a memcpy family function, it probably means you have violated the type aliasing rules and compiler can miscompile your code.

  • @mk72v2oq
    @mk72v2oq Год назад +216

    The last thing is actually a very old concept called discriminated unions. Primarily used in functional languages, it was buried for years during OOP languages golden age, where it is effectively was substituted by inheritance.
    More modern languages are bringing it back though, in light of recent tendencies where inheritance is considered a bad thing. Like Rust has native support for discriminated unions via its enums, TypeScript has them too etc.

    • @naomicoffman1315
      @naomicoffman1315 Год назад +8

      Recent versions of Java support them as well, along with growing support for pattern matching.

    • @jacobzimmerman3492
      @jacobzimmerman3492 Год назад +10

      The only issue with C unions is you don't get the same compile time checks, which are a big part of the safety/refactorability of algebraic data types

    • @alexmiller3260
      @alexmiller3260 Год назад +2

      I think there is no connection to a bad OOP, obviously because even non-OOP languages mostly support basic features like dinamic dispatch through interfaces and type erasure or even allow you to make manual inheritance with pointers.
      d. unions are just way better sometimes. in Rust ranges Option drops hasnext() function to get next element of a collection

    • @lucass8119
      @lucass8119 Год назад +4

      Sum types like you describe allow polymorphism, just like inheritance and interfaces. The problem, or maybe benefit depending on the circumstance, is that it is closed-set polymorphism. All possible concrete types must be known and specified at compile-time. You can't simply create a new type and then conform to an interface - you must go back and modify the original sum type. This can be cumbersome. Consider Rust, modifying an enum ALSO requires modifying ALL match expressions associated with it. The code change is much larger, and you *might* have to modify stuff you don't want to or aren't allowed to.

    • @ahG7na4
      @ahG7na4 Год назад +1

      pascal calls them variant records

  • @locutusofborg
    @locutusofborg Год назад +228

    I have been programming for over 20 years and yet while I knew about unions I never used them. Just goes to show that an old dog can learn new tricks. I love how you really explain the low level side of things. Keep up the great work!

    • @ThePC007
      @ThePC007 Год назад +2

      Did you just never need one or did you just cast between different structs, instead?

    • @locutusofborg
      @locutusofborg Год назад +6

      @ThePC007 I never really understood them and equated them with a struct so just used structs or more commonly classes as most of my programming is in C++ (Qt/C++ to be exact)

    • @VersDarkmoor
      @VersDarkmoor Год назад +8

      When you are doing any kind of protocol, you basically want to use unions. You basially switch case on the opcode ie. first byte and then interpret the incoming frame accordingly. If you don't want to use unions for some reason, you can always do explicit type casting, which is not great.

    • @ayaya-ayaya
      @ayaya-ayaya Год назад +4

      @@VersDarkmoor In my experience we just use explicit write32_be/read32_le and friends to read out protocol fields. Casting unions/structs onto serialized data is a recipe for disaster once you build for a different microcontroller CPU.

    • @macchiato_1881
      @macchiato_1881 Год назад

      @@VersDarkmoorthat sounds really dumb and unsafe

  • @TheKoga25
    @TheKoga25 Год назад +119

    I used it while making my gameboy emulator, it helped a lot for mapping the CPU registers easily and in a lean way. The data inside some of those registers can be 8 bits (high or low part of a 16bit data, the A, B, C, D, E, H, L registers) or 16 bits (when combining two 8bit registers, the AF, BC, DE, HL), it's a neat feature.

    • @Hauketal
      @Hauketal Год назад +5

      But not portable due to endianness and alignment issues unfortunately.

    • @giorgionegro5750
      @giorgionegro5750 Год назад +5

      @@Hauketal you probably can use some preprocessing to make it at least compilable for all endianes

    • @Hauketal
      @Hauketal Год назад

      @@giorgionegro5750 Oh, it is always compileable. But the result will be buggy, if you e.g. push BC onto the stack and then pop B and C separately. There is no standard header known to me to generate different code, except the one for the Linux kernel. But one can check at startup if the assumptions were correct. Like
      union { long a; char b[4]} u;
      u.a = 0x12345678;
      switch (u.b[2]) {
      case 0x34: // little endian;
      break;
      case 0x56: // big endian
      break;
      default: // weird, PDP-11 had 0x12 here
      }

    • @williamdrum9899
      @williamdrum9899 Год назад

      union reg_af{
      unsigned char a;
      unsigned char f;
      unsigned short af;
      }
      Should be fine

  • @slartibartfasttynsol420
    @slartibartfasttynsol420 Год назад +50

    Converting between types (such as the IP example) is called 'type punning'. Historically it has been poorly defined in the C standards, but does work in gcc C and C++ via a documented extension.
    The draft C18 standard does clarify the situation, and explicitly allows it.
    Note: You need to be careful with endianness - the uint32 representation of the IP address would be different based on endianness, so there are portability considerations.

    • @АнтонЕлькин-т8ъ
      @АнтонЕлькин-т8ъ Год назад +15

      It is worth noting that this is explicitly allowed only in C. It is an undefined behaviour in C++.

    • @lucass8119
      @lucass8119 Год назад

      @@АнтонЕлькин-т8ъ Only technically. Its well-defined in all 3 major compilers via extensions.

    • @EarlHutchingson
      @EarlHutchingson 8 месяцев назад +2

      Yeah there's too much undefinedness circling around unions. This video is fairly bad teachings.

    • @ronald3836
      @ronald3836 4 месяца назад

      I do wonder if it is allowed in this particular case, where strcpy() is used to write characters to some of the elements of Onion.str. If I understand right, C does not allow you to read a 4-byte union member after you write to a smaller union member such as a char, and this seems very similar to that situation.

    • @ronald3836
      @ronald3836 4 месяца назад

      ​@@EarlHutchingsonhe also passes a value of the wrong type as the first argument of strcpy().

  • @protonmaster76
    @protonmaster76 Год назад +65

    I use unions in embedded programming as you described. An other trick for mapping a register to a variable is bit fields where you can make members of the union take a defined number of bits.

    • @m1geo
      @m1geo Год назад +2

      This - so useful!

    • @electronlabs2802
      @electronlabs2802 Год назад +6

      Almost all embedded c developers know it :) ..

    • @EMLtheViewer
      @EMLtheViewer 7 месяцев назад +1

      Could you provide an example use case for this please?

    • @protonmaster76
      @protonmaster76 7 месяцев назад

      @EMLtheViewer, on a microprocessor, you'll often need to configure peripherals like a UART or timers. To do so you often need to set a number of bits in a register correctly, there are a number of ways of doing this, non of them are wrong but some are more difficult to read or maintain than others.
      For example,
      UARTConfig = 0x74;
      Although this will work, anyone reading or maintaining this code would need to refer to the processor data sheet to work out what this does.
      union
      {
      uint8_t Enable:1;
      uint8_t Parity:2; // None, even, odd
      uint8_t Prescale:5;
      } UARTconfig;
      UARTconfig.Enable = 1;
      UARTconfig.Parity = Even; // assuming you've configured an enum or some defines to do this.
      UARTconfig.Prescale = CalcUARTPrrscale(19200);
      Then you can set the register to the value of this union. It is easier to read and maintain.
      Bear in mind that it is up to the compiler author to decide what order the bit field is populated, therefore the code is not portable.

    • @martinrodriguez1329
      @martinrodriguez1329 4 месяца назад +3

      It's the first time I see "another" written like that, and it makes sense

  • @KillerSpud
    @KillerSpud Год назад +52

    I use unions to pack 8 byte CAN messages all the time. Its also very useful when using bitfields as well, instead of checking eight or sixteen individual bit flags, you just check to see if the entire thing is or is not zero.

    • @xhivo97
      @xhivo97 Год назад +1

      i hear this all the time, but I don't understand because I thought a lot of this is UB or something like that

    • @feeditehh
      @feeditehh Год назад +3

      @@xhivo97 its probably technically undefined by some strange wording in the standard, but in practice it works fine everywhere.

    • @KillerSpud
      @KillerSpud Год назад +6

      @@xhivo97 yeah your code might not be portable, but in all my applications so far it didn't need to be. It could break if one system had a different endianess.

    • @shauni_jade
      @shauni_jade Год назад +1

      Exact same use case for me, very useful to separate controller ID from data and checskum and so on

    • @aidanbecker9758
      @aidanbecker9758 Год назад

      ​@@xhivo97 The bit-ordering is what's undefined here. AFAIK you can't even rely on tests for endiannes, so you need to test bitfields on a specific compiler/processor if you want to support it. As @feeditehh said in practice it works fine most everywhere, but that next microprocessor that comes out might just find it more efficient to do things in the opposite order.

  • @m4rt_
    @m4rt_ Год назад +14

    I use unions all the time.
    They are really useful.
    I use it when writing a Lexer/Parser. (Though I use it other times too, it's just where I use it the most)
    e.g.
    typedef struct {
    uint32_t line;
    uint32_t col;
    enum {IDENT, NUMBER, STRING} kind;
    union {
    struct {
    char* data;
    uint32_t len;
    } string;
    uint64_t number;
    }
    } Token;

  • @greyfade
    @greyfade Год назад +6

    @ 2:00 - this isn't valid in C++, and is in fact undefined behavior due to the object lifetime rules: Only one member of a union may be active at any time. What you're demonstrating here is type punning, and is considered a 'wrong" use of unions. Instead, use memcpy() (which the compiler will helpfully reduce to an implicit union).

    • @DWal32
      @DWal32 3 месяца назад

      or simply use unions like that because it's funny

    • @JimLecka
      @JimLecka Месяц назад

      with at least one version of Microsoft C++, the compiler treated a union as a base class. Not what I had in mind.

  • @edengriffin3874
    @edengriffin3874 2 месяца назад +1

    Recently discovered these by chance. Absolute godsend for doing bit manipulation on floats

  • @MikePerreman
    @MikePerreman 3 месяца назад +1

    I think the first time I discovered unions was when I was trying to send a floating point value over I2C. I spent way too long in the 'lets just convert it into a string and send those bytes, and then reinterpret that as a float on the host controller'
    Unions just let you solve that whole mess by only transmitting the 4 bytes or however many make up that datatype

  • @LogicEu
    @LogicEu Год назад +11

    Unions are great! It's extremely useful to be able to interpret a single piece of data as different types, structures or even arrays. Say you have a 32 bit pixel representing RGBA channels, sometimes you may want to access individual channels with their own unique names as you would in a struct, sometimes as raw byte arrays and maybe sometimes you want to simply assign a 32 bit value to the whole pixel.

  • @NeunEinser
    @NeunEinser 3 месяца назад +1

    I would have thought C uses them a lot more, just because Rust is so obsessed with enums (which is essentially a typesafe version of polymorphistic C unions, where the Compiler enforces you are not accessing the wrong variant, and the tag byte is not directly exposed).

  • @shanehebert396
    @shanehebert396 8 месяцев назад +1

    Learned about unions when I was teaching myself C back in the early 80s... have used them a lot over the years... very common in systems and embedded programming, data conversion, and the like.

  • @masondaub9201
    @masondaub9201 Год назад +17

    I've used unions for some embedded stuff but most of the time I just end up using a bunch of bitwise operations instead. Never thought about using them for polymorphism though, that is actually pretty cool!

    • @CamaradaArdi
      @CamaradaArdi Год назад +2

      Cool - and dangerous. The compiler doesn't give any guard rails, wouldn't recommend trying at home

    • @dliamk
      @dliamk Год назад +3

      @@CamaradaArdi Got it, I'll try on the prod environment instead 🗿

  • @DavidJohnsson
    @DavidJohnsson Год назад +34

    I do embedded programming and use unions quite a bit. I think they are very useful, but they have their problems. For instance, if your code is targeting two different platforms with different endianness, then multi-byte unions will give you a very bad day!

    • @CZghost
      @CZghost Год назад +6

      This can actually be turned into an advantage, because you can essentially detect endianess this way. **wink wink**

    • @overbored1337
      @overbored1337 3 месяца назад

      Endianness is not a union specific issue. It affects all data structures whenever multi-byte types are involved

  • @chrisalexthomas
    @chrisalexthomas 3 месяца назад +1

    unions are pretty dammed awesome, especially for doing what you did with the register or the ip address, you got a wild array of bytes and then use a union to say the structure of those bytes

  • @rafagd
    @rafagd 3 месяца назад +1

    Unions are the best footgun I have ever played around with.

  • @Otakutaru
    @Otakutaru 3 месяца назад +1

    Unions are a nice thing to have in a barebones language like C, where you are allowed to reassign the type of a variable by raw pointing, a union is just a fast and concise way to do basically the same thing.

  • @herberttlbd
    @herberttlbd Год назад +4

    Prior to database management systems, data was written in records composed of fixed-length fields. Unions were used to reinterpret the layout of those records, where the type was indicated at the start of the record to simulate a tagged union similar to your last example, and to provide ways to access elements within a field, e.g. 8 chars for the whole date unioned with 2/2/4 chars for month/day/year. Storing data like this is why the strn* functions exist in the standard library; they weren't intended to be "safe" versions of the non 'n' variants as people started suggesting in the 90s. I don't know if it is useful to learn how computing was done prior to the 80s as a lot of it isn't relevant today unless you find yourself interacting with COBOL but it is helpful if you want to know where some of this stuff comes from.

    • @aaronfleisher4694
      @aaronfleisher4694 5 месяцев назад +1

      That’s very handy to know. Thank you.

    • @JimLecka
      @JimLecka Месяц назад

      actually, variable length fields and records were used as far back as 1962. An old IBM mainframe term is "count data".
      unions are really just a c version of the EQU or EQUATE assembler directive

  • @hhhsp951
    @hhhsp951 Год назад +2

    Rival Programming Language: "time to go Union Busting!"

  • @leonid998
    @leonid998 10 месяцев назад +3

    Please make one about type punning and Undefined behavior (in C and C++) :))

  • @oglothenerd
    @oglothenerd 6 месяцев назад +2

    I didn't know C had unions! That is so cool!

  • @hhhsp951
    @hhhsp951 Год назад +6

    they're unionizing

  • @homeopathicfossil-fuels4789
    @homeopathicfossil-fuels4789 Год назад +3

    I learned unions quite early in my C language learning process, I find them extremely useful for all the cases you stated here and more. For making VM's for in development hardware and domain specific languages they are a godsend, also comes in handy in game engine programming, hell my yet to be uploaded codebase for a dead simple and easy to use and maintain forth with readable code and a focus on being embeddable in applications as a lua replacement uses that. Speaking of which, I got a video suggestion: Forth! It has seen its fair share of use in embedded systems.

  • @joopie46614
    @joopie46614 Год назад +1

    Flexibility of unions in C++ however is severely reduced because of stricter safety checks so you have to usually work with some ugly reinterpret cast syntax to make it work

  • @brianm.johnson4438
    @brianm.johnson4438 Год назад +29

    Hey LLL, can you do a video on how SIMD works under the hood? Because it's relatively new, not many assembly textbooks cover it and how to write programs to take advantage of it.

    • @xr.spedtech
      @xr.spedtech Год назад

      See anger fog's blog for performance programming.

    • @adama7752
      @adama7752 Год назад

      Under the hood, a CPU has more than just 1 'adder', it has several ALUs. SIMD is aligning those ALUs to do the same thing (add) at once (or double pump). The cost is generally power, and some developer setup to align the data to the boundary (ignoring unaligned simd with intel).
      In short in assembly the easiest is to do a memcpy. While(addr&0x3) copy_byte(); while(addr&(16-1))copy_uint32(); then copy via simd

    • @sinom
      @sinom Год назад +4

      SIMD was first implemented in a computer in 1966. I wouldn't exactly call it "new". And even the modern variant with SSE was introduced in 1999.
      Basically all they are is that instead of only using two registers for e.g. an add instruction, you instead first load all the data you want into special arrays of registers which the operations then get applied on. After that you can move out your result

    • @modolief
      @modolief Год назад

      Yes please; also, how to do it on Apple silicon. I tried some SIMD code on ARM 64 and couldn't get it to work.

    • @NostraDavid2
      @NostraDavid2 Год назад

      ​​@@sinomit's crazy how some concept are typically seen as new, even though they're old as shit: SQL, SIMD, FP (Lisp is from the 50s), etc
      Edit: I have been guilty of this too, btw

  • @konstantinsotov6251
    @konstantinsotov6251 Год назад +11

    I discovered unions through cppreference
    And when I was writing my own json parser as an exercise, unions were the core of library's design
    They are not used really often, but sometimes they are irreplaceable :)
    Edit: and they are what makes C more functional of a language than python and many other popular ones. Because unions and structs are basically algebraic data types

  • @marcopollom
    @marcopollom Год назад +2

    Dude, I'm taking a MicroP class right now and Unions are EVERYWHERE. Pretty much all configuration registers for devices are stored in some sort of union.

  • @furinick
    @furinick 7 месяцев назад

    OK THAT COUPLING WITH A TYPE INDICATOR IN A STRUCUTRE THING WAS SICK AND IS EXACTLY WHAT I NEEDED IN MY PROJECT THANK YOU

  • @johningram420
    @johningram420 Год назад +3

    I've used unions to make my C++ code more readable. I had a Vector3 class that I would use to represent rgb color values, and xyz coordinates. Instead of having two Vector3 types, or storing values twice, I used unions for each of the floats.

    • @moisascholar
      @moisascholar Год назад

      This right here. Great in graphics/game programming when trying to cannibalize memory (especially to have structs/data fit on a single cache line (64-bytes, generally)).

    • @dliamk
      @dliamk Год назад

      Do you happen to have a git repo I could check out?

  • @lorensims4846
    @lorensims4846 Год назад +10

    Every book on C programming I've ever used had had a section teaching about "structures and unions."
    Now, I've rarely ever used unions, but I appreciate how it can give your program an alternate view of your data.
    I can think of several interesting ways to use this feature, but very few practical ones.

  • @aah134-K
    @aah134-K Год назад +2

    I love it because it can make complicated things more easier. I used it as a messaging protocol, where the shared object is a header, with a type, then the buffer after that depends on the header along with crc

  • @Colaholiker
    @Colaholiker Год назад +2

    Being an embedded developer, unions are a daily thing for me.
    However, there is one little thing I'd like to add - when you discuss the size of your json_t and say that the enum takes up only one byte... while that can be accurate, it isn't necessarily so. It depends on the compiler and its settings - I have worked with compilers where an enum always takes up 32 bits, as this is the native word size of the target architecture. In other cases, the minimum number of bytes needed to represent all values in the enum is used.

  • @1rssr183
    @1rssr183 9 месяцев назад

    Short and sweet, really captured the essence of unions, I have never understood it better until now. Thanks!

  • @cozypreneur
    @cozypreneur 11 дней назад

    I was so happy to come across unions when writing a language parser, super useful for the Token struc

  • @m1geo
    @m1geo Год назад

    In embedded, unions are super useful for setting individual bits and bitfields within a word.

  • @dtikvxcdgjbv7975
    @dtikvxcdgjbv7975 11 месяцев назад +1

    You just proved that they are very useful functionality, not just some hidden peculiar oddity that exists solely because the programmer had spare time to fool around.

  • @LordHog
    @LordHog Год назад +1

    Unions are used in quite a few places. One place it is very useful is in messaging. There is a common messaging API which is used by all cores and task. The messaging size is fixed and is a messaging payload. Each of the task will have a different representation of that data payload. So the overall structure/union is what is accepted by the messaging API, but up to the task what that data represents

  • @re.liable
    @re.liable Год назад +1

    I remember using this in Arduino. I wanted to give individual names to my digital output pins, but also iterate through all the pins in a single loop.

  • @RANDASH
    @RANDASH Год назад +1

    Personally, I find unions most useful when dealing with CAN Network packets, since they can be so easily represented using unions (as they use stuffed bitfields a lot of the time)

  • @gaeel330
    @gaeel330 8 месяцев назад

    I've often used unions like in your last example. Usually I'd ensure correctness by using functions or macros to set both the discriminant and the value, just like your printJSON function ensures that you're correctly interpreting the data.
    I'd never seen unions used to provide multiple ways to read/write the same underlying data though, like with your IP address and hardware register examples, that's really neat! It makes unions an effective way to avoid the "primitive obsession" antipattern, beyond a simple "typedef int ipv4_addr".

  • @iankeck3419
    @iankeck3419 Год назад

    In the example at 3:46, one can define as many fields as the register contains by using bit fields in the structure. While more readable than bit-wise operations, you need to measure to see which is more performant.

  • @Seltyk
    @Seltyk Год назад

    The first time I saw unions being used realistically was in some code for an LED light controller. The union had a struct with one byte per color, a 4-byte RGBI value, and a 4-byte array where each position was one part of RGBI.

  • @gmodrules123456789
    @gmodrules123456789 Год назад +1

    You should do a video on bitfield structs, with variable width fields. Section 6.9, page 149 in K&R.

  • @YandiBanyu
    @YandiBanyu Год назад +1

    I have used unions to serialize floating point number as their byte representation. Just write the float to the float part of the union and then read its byte array part when I need to serialize it. Just need to keep in mind the endianess (but that's mostly non-issue for me since the communicating system is always the same MCU)

  • @dutchcanuck7550
    @dutchcanuck7550 Год назад

    Other computing languages have used union-type structures and syntax for decades. For example, COBOL has the REDEFINES clause which does exactly the same thing. It is possible that this feature was added to C in part to enable a C program to interact with software and data from other computer languages. 30 years ago, I was involved in an EDI project (Electronic Data Interchange) involving the receipt of purchase orders and the sending of invoices via X.500. One computer was an IBM mainframe, the other was a Unix minicomputer. X.500 was an expensive protocol, so EDI was designed to send the required data in the smallest packets possible, and made heavy use of REDEFINES and unions to accomplish this.

  • @phraggers
    @phraggers 4 месяца назад

    I usually use unions so I can access certain struct members as either their variable name or their array index:
    union { type name[3]; type name1,name2,name3;};
    useful for things like gamepad buttons so I can loop over all buttons but still use their individual names (of course the same could probably be achieved with defines, or by using a pointer, but after the optimizer has done its thing who knows the packed order without messing with pragma pack) and I've used them in random number generators too to overlap bytes and create some wacky seeds for RNG

  • @crazychicken0378
    @crazychicken0378 Год назад +1

    Something I love to do with unions is treating a multidimensional array as a single linear array for quick one off tasks that would only require a simple for loop instead of a set of nested for loops. Makes rereading my code easier and thinking a lot easier too

    • @Songfugel
      @Songfugel Год назад

      Depending on the use case, this might come with a massive performance hit since it bypasses some cpu and cache optimizations available for dealing with 2D data

    • @user-sl6gn1ss8p
      @user-sl6gn1ss8p Год назад +1

      @@Songfugel do you mean there are optimizations for, say, double[][], vs a double[] with the same number of elements? Can you give me any pointers on that?

    • @Songfugel
      @Songfugel Год назад

      @@user-sl6gn1ss8p Yes, you can search for cpu matrix (that is what 2d arrays can be) optimizations and also how keeping the for loops nested to limit the individual task of the work in the last loop to be in cache limits, the speed and cache optimization of the operations can be much better
      Nested for loops done correctly (you don't branch, and jump out whenever the job is done for that loop) are not bad is some context like 2D number crunching.
      There is an amazing video about it by DepthBuffer on YT called something like "nested loops can make your code faster"
      However, not to mislead, I have to point out that nested branches (like nested IF statements) can be very very bad

  • @adagioleopard6415
    @adagioleopard6415 8 месяцев назад

    Unions are awesome! Had a fun bug with it though, on the ARM you have to specify that it should pack variables, otherwise there are random zeros in the middle

  • @ishi_nomi
    @ishi_nomi Год назад +1

    It is interesting that c can already do so many thing just using struct, while union are pretty restricted to the use case that really make sense. So even if I knew it when I was newbie doing tutorial, I don't ready knew where and how to use it. It is when I once faced a problem that really need union to solve, I really know how to use it.

  • @apmcd47
    @apmcd47 Год назад

    The X intrinsics toolkit (the one Motif is built on) used this trick but with the type identifier being the first member of each struct in the union.

  • @lalpremi
    @lalpremi Год назад

    Interesting, thank you for your introductions to Unions. I will experiment with it and understand it 100%

  • @lukegary4482
    @lukegary4482 5 месяцев назад

    Using a union to alias an anonymous bitfield struct with a byte array and a dword is a great trick to deal with peripheral registers instead of using bitmasks so long as you can guarantee that the struct will be packed properly.

  • @Julian-mc3tc
    @Julian-mc3tc Год назад

    I worked on a machine code excutor for a school project.
    Union saved my butt for "converting" an unsigned short into a array of unisgned char

  • @greenwool4460
    @greenwool4460 10 месяцев назад

    Awesome vid! I remember when my prof was talking about unions I never really got it. I can’t believe it took me this long to actually learn it for real lol

  • @peterjansen4826
    @peterjansen4826 Год назад

    One of the better RUclips-teachers. In this case it is quite simple but explaining it at a fast speed without neglecting details is an art.

  • @franciscoflamenco
    @franciscoflamenco Год назад

    I used unions a lot in my previous job.
    The system's learning data would be flashed into ram all at once, and then we'd break it down into smaller and smaller structs depending on the "module" within the system.
    Because of legacy code, some of the structs could have slightly different type definitions across the code base. Using unions is a slightly more structured way of accessing that memory, as opposed to casting void pointers into the type that you're expecting.

  • @platinummyrr
    @platinummyrr 14 дней назад

    the biggest problem with unions in my opinion is that the best or most useful aspects are lumped into undefined behavior. Accessing the union with a type other than what you stored it is undefined, so some compilers can end up doing tricky things. In particular, multiple fields of a struct in a union may not be laid out how you expect when accessing the union from another type, especially if you want to do things with bitfields in order to have bit level precision of a register, for example. This makes writing portable code more difficult, though if you know your compiler and target platforms, you can do some pretty nice things with unions.

  • @adagioleopard6415
    @adagioleopard6415 8 месяцев назад

    Unions are super usefull for communications.
    You can save the CRC and Opcode and a union with the payload. Then depending on what the opcode is you can read the union in different ways.

  • @zxuiji
    @zxuiji Год назад

    0:41, should be putting floating point numbers 1st in unions, had a compiler complain at me before when I didn't

  • @cainabel2553
    @cainabel2553 3 месяца назад

    Case 3 is called in sum type in ML language like SML or Caml or O'Caml.
    The possible values are the union of the values of the different member types but... no tag is automatically included in C/C++. In ML languages, the tagging is automatic ensuring safe execution.

  • @nick15684
    @nick15684 Год назад +1

    I've used unions for creating generic types in my program and serialization.

  • @longbranch4493
    @longbranch4493 Год назад

    1:00 Yeah, I had been thinking that the structure size was the sum of the sizes of its members. But when I tried to access the members directly by shifting the pointer, it didn't work properly. Turned out there is a thing called padding, some additional unused bytes that the compiler adds to align (whatever that means) the structure in RAM. It is especially essential when you work with files that contain binary data that needs to be mapped with structures. So if you want your structure size to be actually equal to the size of its members, you need to mark the structure as __packed__.

  • @darkobakula5190
    @darkobakula5190 Год назад

    I consider union an early precursor to polymorphism with capability of adding simple RTTI concept by wrapping union and enum together into a struct, many people use unions this way. It's also the easiest and fastest way to understand polymorphism and it's benefits.

  • @Ma_X64
    @Ma_X64 Год назад

    I'm always using them programming for MCUs. Sometimes I need some tricky data rearranges and unions helps me.

  • @paper_cut9457
    @paper_cut9457 Год назад +2

    I knew about unions but try to avoid them due to the fact that they require extra care in thinking how you are phisically organizing memory in your code (think of an array of unions, for example). In my opinion, another hidden gem in C is the "reserve" keyword. Thanks a lot for the video, high quality as usual !

    • @rz2374
      @rz2374 Год назад

      reserve isn't a keyword in c. did you mean register?

    • @naomicoffman1315
      @naomicoffman1315 Год назад

      @@rz2374 The description of a "hidden gem" made me think of restrict.

    • @polarpenguin3
      @polarpenguin3 Год назад +1

      ​@@rz2374Hopefully not because register is mostly deprecated and the compiler will usually ignore it.

    • @hallrules
      @hallrules Год назад

      @@rz2374 he probably meant restrict

    • @itellyouforfree7238
      @itellyouforfree7238 Год назад

      I think you mean "restrict", not "reserve"

  • @RandomGeometryDashStuff
    @RandomGeometryDashStuff Год назад +1

    03:20 will this work different big endian?

  • @johnshaw6702
    @johnshaw6702 Год назад

    A very good explaination of unions with nice examples.
    I do wish to point out that, unless you pack a structure, the size will surprise a few people. Structures, arrays, and types are usualy aligned to word (register size) boundries, which are implementation dependent.
    I once found a bug that wasn't, because the array, containing a string, was automatically aligned. The character array was declared to be 10 and contained 10 characters. The issues was that it reprented a string, which is suppose to be null terminated. The complier actually aligned the array to 12 bytes, not 10, so it worked because the last 2 bytes were 0. Modifying it to be UNICODE compatible made it blow up, because 10 UNICODE characters were aligned and there was no extra bytes to hide the mistake.

  • @tylerhummel236
    @tylerhummel236 11 месяцев назад

    Found out about unions when I was making my final project for CS50x a few days ago. Used one in one of my structs for my fighting sim program.

  • @Ava-x9z3n
    @Ava-x9z3n 5 месяцев назад

    one of my biggest worrys about learning c was that there would be no polymorphism, this is very relieving

  • @372leonard
    @372leonard Год назад

    there are 2 use cases where i wish i had unions in C#.
    1 with vectors and colors xyzw = rgba
    2 with character hitboxes where the position of the hitbox is shared with the characters position. so character.pos = character.hitbox.pos

  • @himonkoch3416
    @himonkoch3416 Год назад +3

    As an embedded developer, unions are SOOO OP it's crazt.

  • @sinom
    @sinom Год назад

    In general union is basically only safe to use when you use it like the ip example. The problem is you can just access the data in the union through all its possible representations at any time, even if it's nonsense. This is extremely unsafe and can cause a lot of issues that compilers etc. can't help you find. So if you're using unions in a different way it's usually best to abstract away the details into some functions that handle it for the user, and never let the user actually touch them.

  • @dekutree64
    @dekutree64 Год назад

    I use them to create a bunch of aliases for the members of vector and matrix types.
    union{struct{float x,y,z;}; float v[3];} Vector3;
    union{float m[9]; Vector3 row[3];} Matrix33;
    union{float m[12]; Vector3 row[4]; struct{Matrix33 mtx33; Vector3 translation;};} Matrix43;
    Very nice when you need to pass a portion of a composed struct as an argument to a function, or access elements with a loop iterator instead of by individual names. The only drawback is that it clutters the debug watch window.

  • @TheGrimravager
    @TheGrimravager Год назад +1

    I have absolutely heard of and implemented unions in C before, the fact that this video will not stop showing up on my youtube really annoys me. I came to comment for 2 reasons:
    1. This is the only video that bothers me, I love your other content

  • @Anubis1101
    @Anubis1101 Год назад

    i use unions to break down types like floats and doubles for complex conversion, as well as dynamically access arrays and heaped memory. i think its great practice for any budding programmer to learn to use them, so its sad to hear theyre relatively obscure.
    theyre super useful even in C++ (even though much of their implementation is UB), much faster than a lot of other included functions and features.

  • @jpudel
    @jpudel 3 месяца назад

    I used union for a simple cli ui system where I create a struct for example button or checkbox and those are un a union which then is contained inside a uielement struct. that worked perfect for adding new ui types and simplefied rendering logic

  • @EvilP3arProductions
    @EvilP3arProductions Год назад +1

    They're incredibly useful in game programming since they're the basis of variant data types. Love me a delicious union

  • @alexjoaquimpereira7671
    @alexjoaquimpereira7671 Год назад

    I had come across unions during my first year engineering class, but there wasn't much focus on it, instead all attention was on structures (due to its use in Data Structures). I always wondered why do we need them, and even most websites' explainations of it being useful for type conversion felt silly as I can do it normally using format specifiers. But this is the first time I found the actual practical usage of them. Especially the polymorphism part, which can be used during some DS experiments involving conversion between postfix, prefix, infix operations.

  • @bresent
    @bresent Год назад +31

    Literally every website that teaches C teach unions

    • @Zzznmop
      @Zzznmop Год назад +3

      I would also argue that most explanations are either lacking or overly verbose - this vid was (Goldilocks voice) juuust right :)

    • @Zzznmop
      @Zzznmop Год назад

      @@Finkelfunk did you actually use them in 1st semester at uni? I’d only heard of brief mentions regarding relations to network/packet programming and embedded use cases

  • @salsamancer
    @salsamancer Год назад

    Unions are basically a syntactic sugar to prevent excessive amounts of explicit casting.

  • @vishwanathbondugula4593
    @vishwanathbondugula4593 Год назад

    I always thought who would use a union, but this video opened my eyes wide open with their power.

  • @JoshuaMaciel
    @JoshuaMaciel Год назад

    This was great timing after just learning about Unions in Zig while doing Ziglings last night

  • @AmirHosseinHonardust
    @AmirHosseinHonardust Год назад

    So what is their difference with rust's enums? I'm wondering if there is a huge difference, considering that we absolutely love them over here and use it basically everywhere, i absolutely count it as one of the biggest selling points of Rust, but then it is not just an obscure thing in C, but also it is counted by some to be an anti pattern at the same tier as goto?

  • @grimvian
    @grimvian 11 месяцев назад

    Thanks for your insightful information about C.
    Because of my bad hearing I struggle, because of the background music...

  • @assimilater-quicktips
    @assimilater-quicktips Год назад

    Really well presented video. I really like unions, have used them in some shape or form for most of my embedded programming, although I have a lot of stuff that can’t use them as cleanly as I would like because I have to worry endianness, but that’s just how it goes

  • @alexaneals8194
    @alexaneals8194 Год назад

    The only time that I have used unions was to overlay registers. So, I could access EAX or AX depending on whether I needed the 16 bit or 32 bit register, but that was a long time ago and it was only for a hobby project.

  • @YannGREDT-nh8et
    @YannGREDT-nh8et Год назад

    thanks for the tips !
    I also use struct of function to make some kind near C++ with C

  • @ProfRoxas
    @ProfRoxas Год назад

    I haven't used C in a while but union is probably my favourite feature.
    But i'm not sure anymore, but i think "the sum of the size of these elements" might slightly be incorrect because of padding?
    Maybe it was the alignment where different size of the members can cause performance benefits because of how they are in the memory?
    In the example it fits well, but maybe if the first element was smaller than the second, the sizeof would be bigger because of the padding?

  • @QuikRay
    @QuikRay Год назад

    I was doing this union/structure stuff 20 years ago....so powerful...especiall as an embedded C programmer....picking out specific bits of a byte...setting/ clearing etc.

  • @rafa_br34
    @rafa_br34 Год назад

    One way I used unions a while ago was to interpret 32-bit color both as an uint8 array and 4 named 8-bit uints using:
    union RGBA32 {
    uint8 RGBA[4];
    struct {
    uint8 R, G, B, A;
    };
    };

  • @rursus8354
    @rursus8354 Год назад

    Probably? You've heard wrong. I used them for storing whatever in linked lists, using a constant to determine how to treat the whatevers. You can do it type safely if you design a small library that only access one link through that library.

  • @ducgia1493
    @ducgia1493 11 месяцев назад

    It helps to deal with type diversity as well

  • @liam1253
    @liam1253 Год назад

    I have used unions in the past for MIDI data to be able to efficiently break apart each byte in the message

  • @redcrafterlppa303
    @redcrafterlppa303 Год назад +1

    I think Union based polimorphism is a really great system. It's sad that there isn't any larger language that has 1st class support for them. Sure rust enums are type safe unions, but the polimorphism strategy isn't first class.
    My idea is to treat polimorphic types as a union of its varients and implement opaque polimorphism using unions.

    • @dynfoxx
      @dynfoxx Год назад

      If you dont know about it look at enum_dispatch

  • @MotorBorg
    @MotorBorg Год назад

    No experience in c, but I've used a similar feature of Pascal called variant records (if I remember correctly) to access individual words or bytes of an integer.

    • @ronald3836
      @ronald3836 4 месяца назад +1

      Haha, I was just trying to remember what they were called in Pascal and stumbled on your comment.