How one thread listens to many sockets with select in C.

Поделиться
HTML-код
  • Опубликовано: 17 фев 2020
  • Patreon ➤ / jacobsorber
    Courses ➤ jacobsorber.thinkific.com
    Website ➤ www.jacobsorber.com
    ---
    How one thread listens to many sockets with select in C. // Ever wondered how to concurrently monitor multiple network connections (sockets) without creating multiple threads? This video shows you how with select(), a universally-available call that so many programmers don't know exists. This is a network-oriented video that shows how select works in a socket server example, written in C, but this technique works with any file descriptor, including files and pipes.
    This is also the first of what I expect to be a series of videos on asynchronous I/O and event-driven systems.
    Related videos (playlist):
    • Network Programming
    ***
    Welcome! I post videos that help you learn to program and become a more confident software developer. I cover beginner-to-advanced systems topics ranging from network programming, threads, processes, operating systems, embedded systems and others. My goal is to help you get under-the-hood and better understand how computers work and how you can use them to become stronger students and more capable professional developers.
    About me: I'm a computer scientist, electrical engineer, researcher, and teacher. I specialize in embedded systems, mobile computing, sensor networks, and the Internet of Things. I teach systems and networking courses at Clemson University, where I also lead the PERSIST research lab.
    More about me and what I do:
    www.jacobsorber.com
    people.cs.clemson.edu/~jsorber/
    persist.cs.clemson.edu/
    To Support the Channel:
    + like, subscribe, spread the word
    + contribute via Patreon --- [ / jacobsorber ]
    + rep the channel with nerdy merch --- [teespring.com/stores/jacob-so...]
    Source code is also available to Patreon supporters. --- [jsorber-youtube-source.heroku...]

Комментарии • 110

  • @athisii_ekhe6167
    @athisii_ekhe6167 4 года назад +7

    Nice! Beautifully explained. Please continue on select() to handle multiple clients and how to read() multiple times.

  • @thisismeanas
    @thisismeanas 2 года назад +9

    Thanks! After 3 days debugging my program, your explanation captures beautifully how the destructive select() method works beyond beej and my class lecture combined. Submitted the assignment with 100/100 on 3-way handshake

    • @5tarstriker78.
      @5tarstriker78. 7 месяцев назад

      Where did you go to school? My school also teaches with beej’s books

  • @NeerajSharma-oz1mm
    @NeerajSharma-oz1mm 3 года назад +1

    Love watching your videos because you're an expert on everything you talk about

  • @baruchben-david4196
    @baruchben-david4196 3 года назад +2

    I really like your videos. They are short and informative. You get right to the point.

  • @unperrier5998
    @unperrier5998 2 года назад +28

    A note that Jacob didn't address: select() is slow. It takes linear time with the amount of sockets to monitor. That's a limitation of the kernel, so there's nothing we can do about it.
    That's why there are more modern replacements (epoll, kqueues) which are much more efficient, in addition to offer more events than select does.

    • @unperrier5998
      @unperrier5998 2 года назад +3

      @Demitri Swan well, no he didn't touch about select() performance problem. Jacob merely said that the number of file descriptors increase, not that the function call to select() (the syscall) is slow to run in kernel mode, which is where performance are hit.
      A larger number of fd in userspace is a small problem because it's manageable.

    • @unperrier5998
      @unperrier5998 2 года назад +1

      @Demitri Swan you're confused.
      The problem with select is twofold:
      1) the biggest problem is the linear time and resources it takes in the kernel
      2) the smaller one is the array that needs to be iterated.
      The 1st one is where the crux of the slowdown lies, the second one is minor, especially because it can be somewhat circumvented in software (likeJacob did)
      Jacob only speaks about the second one, not the first one.

    • @unperrier5998
      @unperrier5998 2 года назад +1

      @Demitri Swan No offense buddy, but you're the one who argued with me without having enough knowledge. Good luck.

    • @eivind261093
      @eivind261093 2 года назад +3

      @@unperrier5998 You're arguing for no reason. One is a consequence of the other. The problem you're describing is specifically mentioned in the video; he mentions it, but doesn't go into detail about the exact consequences. Maybe, I don't know, just maybe, this might have something to do with the fact that this is a video targeted at people who are learning to program with C, rather than those who already know everything.

    • @unperrier5998
      @unperrier5998 2 года назад +4

      @@eivind261093 No they are two separate problems because they act on different data: list on FD in the kernel, and another list of FD in userspace.
      They;re both the consequence of having too many files. But they're not linked in any way. The proof is that in userspace you can get away with the heuristics (hack) that Jacob implemented... but in the kernel you can't do anything about the O(N) behaviour.

  • @batman100572
    @batman100572 2 года назад

    Very nicely explained.

  • @pavangm6867
    @pavangm6867 3 года назад +4

    Your videos are really good!! 👍 when can I see the improvements on this server( I mean event driven programming and asynchronous I/O ? 🙂)

  • @axlslak
    @axlslak 4 года назад +1

    this is in the realm of personal lessons. kids... u better pray you have a teacher this good. u sir are awesum.

  • @esben181
    @esben181 4 года назад +2

    I have made the switch to linux so now i am watching all of your videos again

    • @JacobSorber
      @JacobSorber  4 года назад

      That what they're there for. I hope the switch goes well.

  • @danielkrajnik3817
    @danielkrajnik3817 3 года назад +2

    who could have possibly disliked this gem?

  • @kawkab833
    @kawkab833 Год назад

    very instructive and precise, thank you !

  • @diconicabastion5790
    @diconicabastion5790 2 года назад

    I'd like to see the future videos you mentioned making in this one.

  • @EdwinFairchild
    @EdwinFairchild 2 года назад +1

    thank you for this, was able to use it to monitor serial ports and not hang on a read function.

  • @clumbo1567
    @clumbo1567 2 года назад +2

    Love this tutorial, Would be nice to get an update on this video, you did day you would do improvements in future videos :D

  • @prashantgaur5123
    @prashantgaur5123 4 года назад +1

    Hi Jacob ,
    Hope you doing great.
    Do you have any suggestions for understanding Multithreading and its synchronisation issues.
    As I tried mutex with conditional variable for signalling but now I am stuck with scheduling problem.
    Please do suggest some link or book for better understanding of threads ,mutex and semaphores.

  • @eddaly4160
    @eddaly4160 2 года назад +1

    Great video, thank you...do you have any videos on epoll , (configured for edge triggered operation)? Cheers!

  • @phcmaia
    @phcmaia 4 года назад

    Great video !

  • @FightEverything
    @FightEverything 3 года назад

    Looking forward to next video

  • @lordadamson
    @lordadamson 4 года назад +1

    pretty awesome :D thanks

  • @DaneC020
    @DaneC020 2 года назад +1

    Any chance on more networking videos? I really like seeing these type of videos. Not discussed as often as other C/C++ topics.

    • @JacobSorber
      @JacobSorber  2 года назад +1

      Yeah, there's definitely more than a chance. Any specific topics or questions you'd like to see addressed?

    • @DaneC020
      @DaneC020 2 года назад

      I like the fact you are working without APIs and I'm personally interested in some techniques that could handle more than the FDSET max size. while being safe from ddos attacks.

    • @DaneC020
      @DaneC020 2 года назад

      @@JacobSorber I wouldn't mind seeing how to setup a WSS Connection from scratch. Everyone shows how to do things with API's, I like to get low level and I can't seem to find much on the topic.

  • @laminekaba3064
    @laminekaba3064 4 года назад +9

    Hi Jacob
    I think there is an error. The max_socket_so_far in "select" function should be if(select(max_socket_so_far + 1, &ready_sockets, NULL, NULL, NULL) < 0) .
    I'm currently implementing a voice assistant application using PI (school project).
    Thanks for all your efforts you saved me lot of time for handling multiple clients 👍👍👍.

    • @tastyham
      @tastyham 2 года назад +2

      @You Tube admit this ratio

  • @Murlucky
    @Murlucky 3 года назад +6

    Thanks for the video really helped me understand how to use select.
    I think you made a small error though: select(max_socket_so_far, ...); -> select(max_socket_so_far +1, ...);
    (If you have set two file descriptors "4" and "17", nfds should not be "2", but rather "17 + 1" or "18".) ~man 2 select

  • @oliviam830
    @oliviam830 3 года назад

    Is it possible to use select() with UDP socket? I am trying to use one server to listen to 2 different ports - i.e. 2 sockets with the same IP but different ports. I have UDP client sockets sending messages to each of the ports but I can't seem to read anything from both ports from my UDP server socket, but I can read from a single port when I removed the code for the other.

  • @blameItleaveit
    @blameItleaveit Год назад +1

    I have a humble request for you, it will be really awesome if we can have a Playlist from beginner to advance level or a paid course in C. Please create a course on C 🙏🙏

  • @ttc0419
    @ttc0419 3 года назад +1

    Hi, could you make a video about some newer apis like epoll and io_uring? Thanks.

  • @ajilraju355
    @ajilraju355 4 года назад +3

    Hey, Sieber, Could you please create video about "static" in C, importance, usage... That is very helpful for beginners.

  • @ElGrilledCheezus
    @ElGrilledCheezus 3 года назад

    Thank you for walking me through my networking class haha

  • @Mishiman
    @Mishiman 4 года назад +8

    Hey, surprised to see you using select() instead of poll() and not even mentioning poll() and its advantages over select()
    As you can see select is not very scalable (and not only because of the fd limit), also, its API is a bit cumbersome. It's true that select is widely implemented, but nowadays you'll be able to use poll() on pretty much all machines as well, and if it's a Linux machine then chances are you can use improved versions of poll. Overall poll gets the same job done better and easier. Many argue that select should not be used at all as it's outdated.
    Descriptor multiplexers aside, I'd like to thank you for your multithreading and socket videos, they helped me tremendously while I was building a server in C for my uni project. The project was such a success I was asked to hold a lecture and talk about it

    • @JacobSorber
      @JacobSorber  4 года назад +8

      Thanks! Glad you're finding the videos helpful. Yeah, poll/kqueue/epoll are all on the docket for future videos. I just started with select, since it's the old standard. More to come.

    • @dinnerandlunchchildchild7994
      @dinnerandlunchchildchild7994 3 года назад +1

      @@JacobSorber Thanks, really looking forward for epoll tutorial.

    • @questionmarc8
      @questionmarc8 8 месяцев назад

      @@JacobSorberNote that poll() has been standardized by POSIX since 2008 but epoll() and kqueue() are still not standardized by POSIX to this day

  • @Dizzynuts-cj8zc
    @Dizzynuts-cj8zc 26 дней назад

    Love watching your videos... However could you dig deeper on the functionality differences between poll() and select().

  • @joseortiz_io
    @joseortiz_io 4 года назад +15

    Would love to see the continuation of this. Perhaps using epoll??

    • @dzious
      @dzious 3 года назад +2

      I would love to see this also. The major problem I get while using poll/epoll is that example we can find constantly have only predefined MAX_EVENTS or whatever they call. Which is the max connected clients at the same time.
      Jacob, if you read this and have time to do so, would it be possible to make it with variable max clients connection ?
      Thanks anyway if you make a new video about network programming.

  • @mihirluthra7762
    @mihirluthra7762 4 года назад +1

    hey great! I wanted to read about select(2)/kqeue(2) but just lazy to go through docs. Work made simple and fast!

  • @boxedowl
    @boxedowl 3 года назад

    MOAR!!!

  • @MohamedAhmed-ii4mr
    @MohamedAhmed-ii4mr 4 года назад +1

    Hello Jacob, I have a question outside of the video topic. In your past videos you posted an email to send you code for you to review and comment on. Is that offer still available, I would love to hear your comments.

  • @thegt
    @thegt 8 месяцев назад

    The intro music woke up some people in my house.

  • @Youtube_Stole_My_Handle_Too
    @Youtube_Stole_My_Handle_Too Год назад

    0:56 Is it really necessary to test both if A is zero and not zero?

  • @jace4002
    @jace4002 4 года назад +1

    Thanks!
    Ever considered showing how a dumb/small/simple(I know the useful ones aren't) database back-end works in C? I'd really love to get a feel for what happens "behind the scenes" of SQL-calls and all that. You know, without going through at least a quarter of a million lines of source code...

    • @JacobSorber
      @JacobSorber  4 года назад +1

      I'm not 100% sure what you're asking. Are you asking for a dumb/small/simple database implementation (like with an SQL interpreter)? Or are you looking for a web-app back-end written in C that accesses a database?

    • @jace4002
      @jace4002 4 года назад

      @@JacobSorber The first one. Exactly as you described.

    • @JacobSorber
      @JacobSorber  4 года назад +1

      @@jace4002 I'll think about it. It wouldn't be a million lines of code, but that would still be a SERIOUS tutorial. Maybe if we used a subset of the language, and made it into a multi-video series, including a part about parser generators (like YACC)...I'll see what I can do. :)

    • @jace4002
      @jace4002 4 года назад

      @@JacobSorber That's all I can ask. Thanks!

  • @stealth463
    @stealth463 3 года назад

    Is there any way to find destination port number of incoming UDP packet from a socket ?
    Thanks.

    • @anthonynjoroge5780
      @anthonynjoroge5780 2 года назад

      Yeah.
      Using the getsock() syscall.
      Check out its man page for details.

  • @zxuiji
    @zxuiji 2 года назад

    I'm assuming the future "improvements" will make use of poll instead since it sounds like select just uses that under the hood

  • @m4l490n
    @m4l490n 4 года назад

    Besides portability, especially because I intend to run my server socket in a Linux server, what would be a good reason to use "select"? I ask because I'm looking at "poll" and it's simpler to use.

    • @JacobSorber
      @JacobSorber  4 года назад +1

      poll and select are almost identical (just slightly different interfaces). In fact, in some systems, poll is just a wrapper around select. I started with select because it's the old standard, and I hope to address the newer alternatives as I can get to it.

    • @shushens
      @shushens 4 года назад

      In my experience, in newer systems, select is a wrapper around poll. This is also true in Windows. This is done for the obvious reason of preserving backward compatibility while improving performance.
      The original implementation of select is too old to still be included in modern libc flavours anyway.

    • @riteshkakkar7112
      @riteshkakkar7112 2 года назад

      Pakistann

  • @akashsaxena9252
    @akashsaxena9252 Год назад

    please share the source code link too in description section..

  • @adamlevy8971
    @adamlevy8971 3 года назад

    What files did you #include in the header?

    • @jeffreymarc5
      @jeffreymarc5 3 года назад

      Did you get the answer for this??

  • @legitjimmyjaylight8409
    @legitjimmyjaylight8409 3 года назад

    I cannot get this example working. It keeps seg faulting before main even starts.

  • @Abhinavkumar-lj9uy
    @Abhinavkumar-lj9uy 4 года назад

    Can you please create a video explanation on D-BUS in LINUX

    • @JacobSorber
      @JacobSorber  4 года назад +1

      I'll add it to the list and see what I can do.

  • @DoronOverflow
    @DoronOverflow Год назад

    some errors: first param of select "The number of socket descriptors to be checked", so select(max_socket_so_far + 1...) and in for loop: for(int i =0; i

  • @hectordoyle4718
    @hectordoyle4718 4 года назад +1

    Hello there, Jacob!
    One simple question: what if two clients were to send a message at the very same time?
    Is select() invulnerable for this kind of problem? Because threads are not. Or are they?
    Kind regards!

    • @dorquemadagaming3938
      @dorquemadagaming3938 4 года назад +4

      The description is a bit ambiguous - do you mean two separate connections to the clients (as in TCP or any other connection-oriented protocol), or a connection-less sending to the same socket (as in UDP)?
      Case 1: two connections means two separate file descriptors - they both will be marked as readable, and after doing select()/poll() you choose which order you go to read them - most likely you'd loop through all descriptors from 0 to N and check which ones are readable so the lower FD number probably gets it first.
      Case 2 (two datagrams/packets arriving at the same connection-less socket): the messages will be placed in the kernel's internal queue leading to the file descriptor and it will become readable. You may choose to read only one packet and do select() again - it will instantly return with the same FD marked readable and you can read it again to get the second packet. OR: you can read the first packet and then attempt to read the same FD again and again to read all the packets accumulated in the queue until your recv() returns -1 and errno=EAGAIN which means no more data is available. Then you can return to your main event loop and try select() again. BTW same goes for the stream sockets (TCP) - your first read might return only a part of the available data, so you can try reading again to get the rest.
      Whatever order the packets will be placed in the queue - that's the task of the kernel, it has it's own threads and does proper synchronization between them so the data in the same queue won't get garbled if something truly arrives at the same time.

    • @kathiravankathir3089
      @kathiravankathir3089 3 года назад

      @@dorquemadagaming3938 nice explanation. I got a weird scenario in where the TCP socket is reported "read ready" by select() though the data is read already. I meant select reports the socket ready multiple time in a loop even if it is read already. This behaviour is rectified by setting the socket flag as O_NONBLOCKING using fcntl(). Do u have any ideas to spare?

    • @dorquemadagaming3938
      @dorquemadagaming3938 3 года назад +1

      @@kathiravankathir3089 One possibility, and a very frequent one: the peer has sent some data and then shut down the connection on its side - i.e. it has nothing more to say. So, after reading all the data on your side, the socket will become readable again, but if you try to read, you receive 0 bytes - that means EOF, no more data will come. OS separates these events, to make it possible to distinguish them for the usermode application. What do your subsequent reads return as result and errno? If you don't close the connection it will keep signaling you EOF on the file descriptor.

    • @dorquemadagaming3938
      @dorquemadagaming3938 3 года назад +1

      By the way, shutting down the connection is not the same as closing it, as it can be done unidirectionally. Read on shutdown() vs. close() functions. This is a typical behavior of the one-request HTTP clients (e.g. REST-clients) - they send one request and then do shutdown(fd, SHUT_WR) on their socket - this way the server receives an EOF notification - no more requests expected, but it still can send the reply, because the server-to-client direction is still open.

    • @riteshkakkar7112
      @riteshkakkar7112 2 года назад

      Pakistannn

  • @user-zt7je7ke1r
    @user-zt7je7ke1r Год назад

    Make a video on poll and epoll :)

  • @arden8133
    @arden8133 3 года назад

    When do you close the serverside listening file descriptor, this program is currently leaking resources
    on second thought, you'd most likely close the listening file descriptor after the server is finished servicing the client..

  • @artemrusinov3034
    @artemrusinov3034 2 года назад

    is epoll better ?

  • @Davtd.
    @Davtd. Месяц назад

    couldn't we store our clients fds into a dynamic array and iterate over that instead of FD_SETSIZE?

  • @minimoon300
    @minimoon300 4 года назад

    help me clarify something .... is his shirt blue or green ? (don't tell me turquoise)

  • @John-nb7bw
    @John-nb7bw 4 года назад +1

    shoutout cse130!

    • @JacobSorber
      @JacobSorber  4 года назад

      Where at? UCSD?

    • @John-nb7bw
      @John-nb7bw 4 года назад +1

      Jacob Sorber no at ucsc, we’re making a load balancer

  • @drywater3729
    @drywater3729 3 года назад +1

    CSC209 A4 GANG

  • @nexovec
    @nexovec 3 года назад

    hmmm... hmm hmm hmm hmmmm... hmm hmm hmm hmmmm

  • @riteshkakkar7112
    @riteshkakkar7112 2 года назад

    Pakistann

  • @cd1168
    @cd1168 7 месяцев назад

    Why would you charge money on Patreon for a simple college class socket C program ?