EVERYONE Needs to Learn a Little Bit of AWK!

Поделиться
HTML-код
  • Опубликовано: 31 май 2024
  • AWK is a brilliant scripting language for processing text files, comma separated values, and text databases. Deal 60% off - Machine Learning & Data Science Bundle: andauth.co/MachineLearningBun...
    If you find yourself creating spreadsheet to process rows of data or numbers then you MUST learn some AWK!
    ####
    Machine Learning & Data Science Bundle - andauth.co/MachineLearningBun...
    ####
    Introduction to Android app development: www.dgitacademy.com
    Let Me Explain T-shirt: teespring.com/gary-explains-l...
    Twitter: / garyexplains
    Instagram: / garyexplains
    #garyexplains

Комментарии • 1,1 тыс.

  • @nickiebanchou
    @nickiebanchou 4 года назад +939

    "forget spreadsheets and excel"
    crowd: ooh?
    "use command line!!"
    crowd : oh...

    • @1MinuteFlipDoc
      @1MinuteFlipDoc 4 года назад +43

      yulp! LOL Python (heavy duty jobs) > Excel > AWK.
      AWK is a legacy tool. there's a reason people don't use it. hahahah

    • @tanithrosenbaum
      @tanithrosenbaum 4 года назад +122

      @@1MinuteFlipDoc Except for those people who do use it. There's nothing legacy about it, it's just different from excel. And while that obviously doesn't include you (and that's fine), there are lots of people who prefer to write a short script on the command line over clicking through excel dialogs for 2 hours. For conditioning data to use it in data science and number crunching, awk gives you an amazing amount of productivity much quicker than excel does. If you still prefer excel that's fine though, your choice.

    • @andiriki
      @andiriki 4 года назад +3

      nickie banchou thanks you save me Time.

    • @JohnR31415
      @JohnR31415 4 года назад +40

      1MinuteFlipDoc awk is very powerful, use it pretty much every day. Can’t use excel in pipe chains...

    • @mrclaytron
      @mrclaytron 4 года назад +36

      @@1MinuteFlipDoc no, people do use it all the time! Plus, it fits perfectly with the Unix philosophy of piping data from one small terminal program to the next. It's been around since the 1970s, but that certainly doesn't mean it's legacy - it's still an amazingly powerful tool that people still choose to use.

  • @michaelburns8073
    @michaelburns8073 4 года назад +39

    If you like using Awk to do stuff like this, there are a few other cool Unix tools you should have a look at. Split, join, uniq will allow you to do DB-like things on text files. Another is tr. Of course sed, cut, grep as well. I took a grad course in which we had to create software development tools only using Unix tools. That class was both illuminating and fascinating. Learned a lot that I still use to this day.

    • @99corncob
      @99corncob 2 года назад

      A lot of us learned to process tables of data using all these tools before spreadsheets were invented. Welcome to the club.

    • @CARPB147
      @CARPB147 2 года назад

      Oh yes. Think of AWK as SQL for text files. You can SELECT rows, and columns (words separated by whatever delimiter applies), and even declare "WHERE" (conditions) using regular expressions and/or boolean operators. Your input could be log files, emails, or whatever you have on text (like text stuff on chats). It could be source code of any programming language if you are interested in gathering quality metrics on them. Your imagination is the limit.

    • @yashwanth.chaudhari339
      @yashwanth.chaudhari339 10 месяцев назад

      CAN YOU share ur knowledge?

    • @egorsozonov7425
      @egorsozonov7425 9 месяцев назад +1

      Yes, just like you said, "db-like". This is reinventing SQL with a bunch of half-assed incoherent unorthogonal tools barely cobbled together. It's simpler to just use a real DB like Sqlite.

    • @drwombat
      @drwombat 7 месяцев назад

      As previously asked can you share your program you made? Or anything like it?

  • @SEX_ON_DRUGS
    @SEX_ON_DRUGS 4 года назад +291

    Gary I will never forgive you for teaching everyone my only impressive linux skill

    • @senantiasa
      @senantiasa 4 года назад +10

      LOL, oh come one... the FOSS world is about sharing.
      But I completely understand what you're saying. It's also my only Linux skill that no one I know have the slightest understanding, yet they call me in for it...

    • @satibel
      @satibel 3 года назад +1

      regex-plosion!

    • @mikicerise6250
      @mikicerise6250 3 года назад

      Thanks, Gary. ;)

    • @ylc870
      @ylc870 3 года назад +5

      Well those who really want learn this still have a bunch of documentation to read... No worries

    • @zonumev
      @zonumev 2 года назад

      And he did it so effortlessly, made it look easy ... the mark of an expert! I'm happy to know that AWK ain't a dead skill 'cause I still rely on it.

  • @carndt124
    @carndt124 2 года назад +17

    Used AWK in 70's and 80's. Had an accounting system written in AWK. Also, had an AWK to C compiler, for the real hardcore number crunchers.

    • @andresmusetti
      @andresmusetti 2 года назад +1

      Now that's a new level of awesomeness

    • @Prem-jd8li
      @Prem-jd8li 2 года назад

      Very nice!

    • @mal-avcisi9783
      @mal-avcisi9783 2 года назад +1

      i used AWK in 1941 in ww2 to decrypt enigma. was fun time

  • @meowcula
    @meowcula 2 года назад +7

    awk is one of my favourite tools, not because I'm any good at using it, mind, but just because of its history. It's from 1977! The basic apps of our unix/linux ecosystem have a rich history.

  • @captainz9
    @captainz9 3 года назад +14

    Awk and grep were the heart of many scripts I've written over the years.

  • @prettylitch1785
    @prettylitch1785 2 года назад +12

    00:00 Intro
    01:15 more show data in file
    01:29 wc -l show number of lines in data
    01:48 awk '{print}' prints file
    02:24 awk '{print $0}' $0 prints every line, $1 prints first field ect.
    03:10 awk '/gcc/ {print $1}' match gcc print file name
    03:41 awk '/^w/ {print $1}' Lines start with a double-u
    04:23 awk '/path/ {print $1,$2}' Lines start with path, include multiple lines
    04:41 awk '/path/ {print $1,$2/1024}' can divide output

  • @jimsteele9261
    @jimsteele9261 2 года назад +15

    Back when I was contracting for EDS one of the sys admins handed me a book on AWK and asked me if I could figure out a way to extract billing info from the processing logs. I was hooked. :-) One of the handy features was using strings as array subscripts, and having "sparse" arrays, where only the accessed elements existed. Eventually, I had most of my production automated with AWK scripts.

  • @ripwolfe
    @ripwolfe 3 года назад +3

    I used awk to do data analysis of a large database that output data as a .csv. The company hired an "analyst" (really, he just ran DB scripts) to put the data into Excel, format it and run a few math functions on it. It took 3 hours to complete their way. The awk way took less than 15 minutes.

    • @lacs83
      @lacs83 3 года назад

      Una historia de exito en el uso de AWK

    • @CARPB147
      @CARPB147 2 года назад

      Excel is good at many things, but doing what that analyst did was shstoopid... "A hammer thinks that every problem is a nail."

    • @d3vilscry666
      @d3vilscry666 3 месяца назад

      Was it accurate tho?

  • @zeke6099
    @zeke6099 4 года назад +8

    I've tried reading more than one awk intro and I've never made it very far. You've successfully taught me way more than all of them, thank you!

  • @RoryIsNotACabbage
    @RoryIsNotACabbage 2 года назад +1

    Man I've watched so many awk videos and this has been the only one that has said "this is why you do it" rather than just do the thing
    I don't know how people expect you to learn when they don't explain what you're doing

  • @rondamon4408
    @rondamon4408 4 года назад +310

    AWK? I thought I was the only dinosaur in this world which still uses AWK. Glad to know that I'm not the only one.

    • @wv1138
      @wv1138 4 года назад +6

      That guy (or gal) that they don't let out much whips out awk like an old trucker whipping out a snatch block

    • @JohnDavidDunlap
      @JohnDavidDunlap 4 года назад +6

      Use it every day.

    • @soiledhalo2296
      @soiledhalo2296 4 года назад +15

      It's been a while since I've used it. First time I ever had to use it was in the early 2000s at a call centre. Their call detail records were gigabyte size and excel was struggling with it. AWK just crunched through the numbers and spit out the results in less than 5 minutes. Think I used SED initially, but AWK was the answer.

    • @johnnyq4260
      @johnnyq4260 3 года назад +11

      Now if you two will mate, the dinosaurs will not go extinct.

    • @johnmcginnis5201
      @johnmcginnis5201 3 года назад +14

      I can have a log file parsed with awk faster than that young'n with a spreadsheet. And reformatting an address file that has quotes drives excel nuts. In awk you just manipulate the FS constant. I might be a dinosaur but I would point out that the dinosaur ruled the earth for 165m years and birds are still here.

  • @AuctorisVideo
    @AuctorisVideo 4 года назад +4

    Nice video Gary. In case you (or anyone else watching) didn't know - you can simplify your "rounding up the file sizes" example using printf() and a format specifier.
    For example to print the file sizes to one decimal place you could use:
    {printf("%s %.1fKb
    ",$1, $2/1024)}
    Or:
    {printf("%s %.0fKb
    ",$1, $2/1024)}
    To recreate your exmaple & round to the nearest integer…

  • @monikabalint400
    @monikabalint400 3 года назад +9

    I survived my PhD thanks to awk and sed!! Command line rules!

  • @FrankyDesign
    @FrankyDesign 4 года назад +106

    Awk is really amaz...
    Syntax error: Missing ending '}'

    • @ThePsyForce
      @ThePsyForce 3 года назад

      % in vim will skip between matching parentheses

    • @dosmastrify
      @dosmastrify 2 года назад

      Java too

    • @dosmastrify
      @dosmastrify 2 года назад

      Missing bracket here
      (insert bracket)
      Extra bracket here
      (smashes keyboard)

    • @CARPB147
      @CARPB147 2 года назад

      That'll do it.

  • @senantiasa
    @senantiasa 3 года назад +130

    This is the only skill in my job and my previous job where people turn to no one other than me..:*( Getting sentimental here...

    • @paschikshehu7988
      @paschikshehu7988 3 года назад +2

      what do you do?

    • @senantiasa
      @senantiasa 3 года назад +14

      @@paschikshehu7988 Systems engineer, but it's programmers who come to me for this, usually because they need simple parsing or data manipulation which doesn't take a lot of effort. Then, their programs run my AWK script. These programmers know Sed (which is even simpler than AWK), but their case is usually where Sed is too simplistic and using their programming language would be overkill.

    • @Frankx520
      @Frankx520 3 года назад +1

      kaluq system engineer is so cool. I just found a good PID value for my motor speed control using Matlab earlier today.

    • @afonsosalbrecht
      @afonsosalbrecht 2 года назад

      omg, if you are working as software engineer and they can't do it... change jobs. I doubt your are technically growing in this company hahahah

    • @aneeshprasobhan
      @aneeshprasobhan 2 года назад +1

      @@Frankx520 he said systems engineer, not Control Engineer.

  • @NexuJin
    @NexuJin 4 года назад +23

    I uses awk/sed on a daily basis at work. I uses AWK primary to analyze excel(exported to csv) or other data files for audits. That is on Windows! In both MINGW64 or WSL2 Linux.

  • @srirachahero4679
    @srirachahero4679 2 года назад +3

    It's 2:00AM and I'm watching Gary Explain awk... and it was amazing!
    Thank you, Mr. Simms!

  • @BryanChance
    @BryanChance 2 года назад +2

    Awk and sed one of the most useful and powerful text manipulation and formatting tools I ever learned to use.

  • @AllanKobelansky
    @AllanKobelansky 3 года назад +31

    AWK isn’t a number manipulation tool. It’s a text processing tool that can do math. And so much more. Explore GAWK, the Gnu version.
    And use the tools you know, as best as you can to get the job done. And don’t stop learning.

  • @IreshDissanayakaM
    @IreshDissanayakaM 3 года назад +9

    This is the best introduction to awk I have encountered.

  • @nickskywalker2568
    @nickskywalker2568 3 года назад +4

    Best introduction I've ever seen! I've always been kind of reluctant to learn, but knowing inside that I should do it...
    Thanks for the video!

  • @LoneIgadzra
    @LoneIgadzra 2 года назад +2

    For small files there are more user-friendly tools, but awk really shines when you have some huge text file that you need to massage because it is just about as fast as you can get. Really important to know about in those cases, because you could easily be lead down an unnecessarily slow or expensive "big data" path.

  • @huistelefoon5375
    @huistelefoon5375 2 года назад +1

    AWK has been used to create a full parser/tokenizer, and other purposes that are arguably way more complex for its intended use. Using AWK you can also make advantage of pattern matching with regular expressions, and AWK has many other tools for text manipulation. But I think one of the most powerful aspects of AWK is using it as a complimentary Unix tool. Use it together with other Unix command-line utilities with pipelining, not everything has to be done in AWK. For example, you can use AWK to parse out formatted words from a complex text file, and now pipe this data to be processed by another utility.

  • @juzujuzu4555
    @juzujuzu4555 4 года назад +31

    Gary, I love these introductions to Linux/unix commands/software.

  • @stumbling
    @stumbling 3 года назад +4

    Used quite a bit of AWK in my 3rd year physics project. I had hundreds of experimental data files to process and it was a good choice.

  • @SimonJM
    @SimonJM 2 года назад +1

    My ex-colleagues used to hate me writing awk scripts! Brilliant little language. One happy use was to take the output from informix commands to detail table descriptions and create an output shell script to recreate the database for disaster recovery purposes.

    • @CARPB147
      @CARPB147 2 года назад

      Perfect application for AWK. Nice.

  • @erectlocution
    @erectlocution 3 года назад +1

    I really appreciate how you build up here. I've done most of this in my tinkering but this adds some context that's really helpful.

  • @aravindhreturns
    @aravindhreturns 3 года назад +7

    WHOA! As the class went on, My eyes only widened. Thank you Gary! Much love

  • @aseerose5684
    @aseerose5684 4 года назад +3

    I have used it to generate a useful database of user information from emails after concatenating the emails for processing. It was not hard to learn and ended up being a very useful multi-purpose tool in addition to its primary mission success. Thanks for an excellent video!

  • @jameschums
    @jameschums 2 года назад +1

    I have been using AWK for 20 years, it rocks!

  • @talbech
    @talbech 4 года назад +2

    Thanks. Brought back some great memories of data manipulation of huge point cloud datasets on SGIs. We had to do very similar things before piping data into the OpenGL 3D engine for visualisation purposes. Awk is very flexible and fast and still have many usecases in todays system administration tasks.

  • @inwhichidie7171
    @inwhichidie7171 3 года назад +11

    Just want you to know you saved my ass with this video. Procrastinated on an assignment for my CS class and this really helped me understand some stuff I'd missed and get the assignment done in time. Thanks a ton!

  • @mahmoodabdulla5499
    @mahmoodabdulla5499 3 года назад +4

    This is the best awk tutorial I've seen so far.. please make a video for SED

  • @MrMusic25
    @MrMusic25 3 года назад +1

    Very glad this video popped up on my feed! I've been currently working with data using sed but after watching this i think awk it much more suited for me, especially knowing I can write my own functions that run faster than Bash can! Great video, thanks for the explanation!

  • @merlin-ju6fu
    @merlin-ju6fu 2 года назад +1

    This is the first I have heard of AWK. I am number crunching sports recreational sports handicapper, so AWK might be useful to me.

  • @richa9719
    @richa9719 2 года назад +18

    I used awk in the early 1990’s as a developer. Not sure it is something that a non developer should really use. Good demo, I wrote complex code with Awk to parse flat files.

    • @CARPB147
      @CARPB147 2 года назад

      That is what AWK is perfect for. Great for processing log files or other UNIX/Linux sysadmin stuff.

    • @SteveJones172pilot
      @SteveJones172pilot 2 года назад +4

      Not sure a non-developer should be using?! What's wrong with learning to use a tool? Should a non-sysadmin never open a command prompt because they're not an expert?! How do people become developers then? Any tool like this you can learn to use is a huge plus! Let's encourage experimentation and learning!

    • @CARPB147
      @CARPB147 2 года назад

      @@SteveJones172pilot - Totally agree with your sentiment on this. People will either be interested (or not) in doing this stuff. That will naturally weed out people that program (programmers/developers, etc.) from people that don't. I don't see any reason to have arbitrary mandates against writing AWK commands (or scripts) for people only because they are not *professional* developers. Anyhow, the 40+ year secret is out! 😏😆

    • @CARPB147
      @CARPB147 2 года назад

      Oddly, my very first AWK script was my most complex! Only a page and a half, but it replaced a 40-page SQR program that tried to parse CSV files (ugh) written by a hammer who thought every problem looked like a nail.
      After that I would use it in smaller piped sequences with several AWK commands like:
      awk '{if ( FNR == 1 ){print "FILENAME=",FILENAME}; if (NF && $0 !~ /^--/){print}}' Database/*.sql \
      | awk '/^FILENAME=/{files++}; $0!~/FILENAME=/ && NF {loc++}; END{avgloc=loc/files; printf "%d Database LoC in %d files at avg lines per file = %0.f
      ", loc, files, avgloc}' > $countfile
      (sorry about the look of the run-in line).
      For more complex problems, like ETL cases, etc., I just used Perl which was a natural progression from using Shell + AWK.

  • @hereb4theend
    @hereb4theend 4 года назад +16

    AWK, Sed, Bash, TCL, GREP, Perl and Nvim are my command line friends. 😍

    • @josephdoyle5304
      @josephdoyle5304 3 года назад +4

      Actually, grep was taken out of the line editor ed. The command in ed is
      g/RE/p ( globally search for a regular expression and print the line). Hence "grep RE filename"
      nawk has more capabilties.
      BTW "awk" are the initials for Aho, Wineberger, and Kerinigan, the developers who created awk.

    • @hereb4theend
      @hereb4theend 3 года назад

      @@josephdoyle5304 you're a prince amongst men. 😊

    • @thaddeusolczyk5909
      @thaddeusolczyk5909 3 года назад

      Philistine emacs!

    • @zyan983
      @zyan983 3 года назад

      @@thaddeusolczyk5909 Emacs is nice, but I don't see a reason to use it, so I stick with nvim.

    • @AbhishekBM
      @AbhishekBM 3 года назад

      @@thaddeusolczyk5909 the command line version of Emacs is terrible.

  • @scottkelbell5003
    @scottkelbell5003 4 года назад +5

    Easy to learn, too. I love awk! Thank you Gary!

  • @websciencenl7994
    @websciencenl7994 4 года назад

    Wow, i've just stumbled upon your channel and i'm very very impressed with your teaching skills. Absolutely brilliant. Thank you soooo much for doing this!

  • @SpeccyMan
    @SpeccyMan 4 года назад +46

    Just thought I'd explain, that 'up arrow' is a caret or circumflex.

    • @peppigue
      @peppigue 3 года назад +3

      That's why ppl call it up arrow

    • @hecko-yes
      @hecko-yes 3 года назад +3

      @@peppigue you wouldn't call v a down arrow even though it's used that way sometimes
      < > can be less/greater than symbols or angle brackets or left/right arrows, but in a programming context you'd probably use the former
      ...except when it's a "shift left" operation in which case it'd make sense to call them arrows
      hm
      maybe left v and right v??

    • @randycreasi
      @randycreasi 3 года назад +2

      Sometimes people call it "hat" referring to the hat operator in mathematics.

    • @gorgolyt
      @gorgolyt 3 года назад +6

      Weird 'flex but okay.

    • @jwadaow
      @jwadaow 3 года назад +2

      @@gorgolyt very circumspect

  • @villesyrjala3354
    @villesyrjala3354 4 года назад +8

    I've used awk to extract useful information from pdf documents. The problem was that the information was awkwardly (pun intended) split into several tables throughout the document so I had to first process each table to collect up all the pieces for each element I wanted to output. The solution I came up with was pdttotext + awk to do the processing. The few hours I spent on that awk script has paid off nicely since I've had to reprocess new versions of the same document several times over the years.
    The alternatives would have been:
    a) Manually copy paste all the information. One thing I've learned over the years is you *never* trust anything copy pasted by a human (least of all myself)! Also would have been extremely tedious (which adds to the chance to making a mistake), and I would've had to repeat it whenever a new version of the document came out.
    b) Find some pdf library for my favorite programming language to extract more structured data from the document. Couldn't quickly find anything that worked and I didn't want to start debugging pdf libraries.

    • @PraveenKumar-xq7xb
      @PraveenKumar-xq7xb 4 года назад

      Hi I am interested in pdf extraction. Can you kind of give some clue codes to me to explore further.

    • @mychromebook9935
      @mychromebook9935 3 года назад +1

      Another solution is to use a command line pdf editing tool called, pdfTK. You can read out pdf files from command line and even fill in pdf forms with it.

    • @stevebabiak6997
      @stevebabiak6997 3 года назад

      IIRC, Perl has some modules (think libraries) for handling PDF files, and Excel files too.

    • @CARPB147
      @CARPB147 2 года назад

      Cool.

  • @asandax6
    @asandax6 2 года назад

    I've been just doing this from any programming language I was learning when I get to the read and write files section of the documentation. Nice to see it can be done directly on the command line.

  • @damiancosta2588
    @damiancosta2588 2 года назад +2

    Gary, awesome job giving me the basic understanding of awk. All my little failed projects have been revived since Your walk thru of the AWK!

  • @cheaterman49
    @cheaterman49 3 года назад +10

    Thanks Gary, that was simple and useful. I usually write small Python scripts for such data processing because I'm more fluent in it, but if it were CSVs or tab-delimited outputs (like from other shell programs) awk is just way simpler. Always wished I got some proper simple intro to it, now it's done :-) - perhaps you could make a follow-up episode or two? Thanks again!

  • @ibuprofenPill
    @ibuprofenPill 2 года назад +56

    As a senior software engineer, I understand the advantages of what you’re demonstrating. However, the common person will never use this on a wide basis. But, your heart is in the right place.

    • @johnmcginnis5201
      @johnmcginnis5201 2 года назад +2

      That is indeed a problem. I have overcome it for many tasks for mere mortals by writing awk code for the task then front ending that with zenity. Good enough that when I need to see the division manager she makes sure I am at the top of the list in the morning. :)

    • @CARPB147
      @CARPB147 2 года назад +1

      @@johnmcginnis5201 - I had the same experience. I remember the first time I took a *formatted report*, ran it through AWK, and provided my astonished manager with information she needed selected and totaled in a totally different way in a matter of minutes.

    • @not_herobrine3752
      @not_herobrine3752 Год назад

      yeah the comment section goes "just use python"

    • @syte_y
      @syte_y 6 месяцев назад

      When i was doing a lot of data processing on csv files this would’ve been useful. don’t think it applies to a lot of people like you said though

  • @Chris-op7yt
    @Chris-op7yt 4 года назад +2

    awk is useful as a very short/quick way to get at Nth field in a text file, especially as it treats consecutive delimiters as one. for example, fixed width input files.
    But where awk really shines is multi-levelled line delimited files, like old style config files etc., as you dont need to write loops and keep flags about which section of the input file you're in.

  • @randomsalgari
    @randomsalgari 2 года назад +2

    I really love awk, it was one of the main tools I used for my thesis.

  • @nickharrison3748
    @nickharrison3748 3 года назад +3

    awk -F "/" {print ...} . for specifying the different seperator other than default whitespace

  • @shikhanshu
    @shikhanshu 4 года назад +16

    I use awk at my job and I am always in awe of it. This video is a great little intro and the rounding logic was pretty neat too! Thanks Gary!

    • @timothykeith1367
      @timothykeith1367 3 года назад

      gawk

    • @DeepakKumar-lv4te
      @DeepakKumar-lv4te 2 года назад

      surely adding 0.5 to say 2.1 doesn't give the correct rounding integer as it doesn't round up ?

  • @Ureallydontknow
    @Ureallydontknow 4 года назад

    high quality video. perfect audio. the pace was perfect. the explanation and examples were perfect.

  • @JR-pk1fr
    @JR-pk1fr 3 года назад +1

    I love it! The best awk demo on RUclips. More awk vids in the future???

  • @johng7rwf419
    @johng7rwf419 4 года назад +6

    Good old integer arithmetic, takes me back to when I was a lad..
    Awk would have been very useful in a former life. Thank you very interesting.

  • @basilburgess8415
    @basilburgess8415 3 года назад +7

    I love awk for text formatting and, arguably informal reporting. Also admin scripts. Honestly, though, you can all this and more with PERL, which I recommend.

    • @xrafter
      @xrafter 2 года назад

      Isn't perl a prolang ?
      Prolang = PROgramming LANGuage

    • @CARPB147
      @CARPB147 2 года назад

      Yes! (upvoted you for being spot on)

  • @Rovsau
    @Rovsau 3 года назад +2

    Thanks for the info. Spreadsheets are more practical for me, and more practical for newbie experimentation, but AWK really bridges the gap between sheet logic and optimizing work. Plus, you get support from a lot more scripting capabilities. Great stuff!

  • @mfolarin
    @mfolarin 3 года назад

    Gary! I'm excited to find your channel. Pleasing. Subscribed!

  • @adam872
    @adam872 4 года назад +3

    Love me some AWK and have made plenty of use of it over the years.

  • @apmcd47
    @apmcd47 3 года назад +2

    A long time ago when someone was telling me how wonderful Excel was, I simply said "ed, perl, tbl, troff" as in edit your data using ed (actually, I never use ed), process it with Perl (I don't know awk), and finally format it with troff using the tbl preprocessor.

  • @mikeCavalle
    @mikeCavalle 2 года назад +1

    lol - awk in the late '70's -- life saver then and now.

  • @Mildimage
    @Mildimage 3 года назад +1

    Awesome video man!. This was my first contact with awk command. Very very good explanation!

  • @yjk_ch
    @yjk_ch 3 года назад +4

    I remember using Awk for extracting a column from a command result(using something like {print $1}, but I didn’t know that it could do much more than that.

  • @robbo916
    @robbo916 4 года назад +3

    I won't go into the specifics but AWK holds a special place in my heart. I know that might sound a bit weird but it's true. Even though I've only ever had to use it in anger twice it was well worth learning just for them.

    • @philipcoltharp918
      @philipcoltharp918 2 года назад +1

      Thanks, Im glad I clicked in. I never would have searched out this otherwise. I do use SED, GREP, and GVIM. The next time I have the opportunity, I'll have to try to apply these lessons.

  • @ORagnar
    @ORagnar 3 года назад +2

    He has the smoothest advertising transitions I've come across. Great job! Great content, too!

  • @SteveJones172pilot
    @SteveJones172pilot 2 года назад +2

    Great video! I have always used grep to search strings in linux and never bothered to figure out what awk did.. This was a great introduction - Just what I need so that next time I have a use case I will remember this and figure out how to do it in awk!

  • @watcherquek263
    @watcherquek263 4 года назад +8

    perl >> (sed, awk ). You can do all of sed and awk in perl (and there are even conversion scripts for it, called a2p and s2p) but not the other way.

    • @jrd33
      @jrd33 4 года назад

      Sure. And an 18-wheeler can carry more than a pickup. But a lot of people find a pickup works just fine for day-to-day tasks.

    • @CARPB147
      @CARPB147 2 года назад

      @@jrd33 - Certainly. There are tools more suited for certain jobs than others. I think it is good to have variety of choice.

  • @1MarkKeller
    @1MarkKeller 4 года назад +31

    *GARY!!!*
    *Good Morning Professor!*
    *Good Morning Fellow Classmates!*

    • @GaryExplains
      @GaryExplains  4 года назад +10

      MARK!!!

    • @jamess1787
      @jamess1787 4 года назад +2

      Mark, sit back down and turn to page 33 in the 2020 edition of GE

  • @oxynetz
    @oxynetz 2 года назад

    'Gentle introduction to awk' .... gentle if you're like Gary... Thanks for this

  • @12Q46HPRN
    @12Q46HPRN 3 года назад +1

    Excellent! You explained awk very well!

  • @spaceiswater6539
    @spaceiswater6539 4 года назад +5

    Please can you make some more AWK videos Gary?
    I'm learning AWK at the moment spent a few days on it, its hard to learn but the rewards in knowing how to use it is worth the reward. This is a great video to get people into using it and seeing the power of it.

    • @johnmcginnis5201
      @johnmcginnis5201 3 года назад +1

      1) Learn a middling amount of 'C', K&R please, none of that C++/# crud.
      2) Have a good understanding of regular expressions.
      3) Realize that each line is processed in the order received by the program statements after BEGIN and before END. Process order can be important.

    • @xrafter
      @xrafter 3 года назад

      @@johnmcginnis5201
      Why we need to put the {} between print .
      What does it mean?

  • @oguzaranay
    @oguzaranay 4 года назад +5

    My first time hearing and knowing this language!

    • @CARPB147
      @CARPB147 2 года назад +1

      Oh, oh. The secret is coming out!

  • @marklyttle5449
    @marklyttle5449 2 года назад

    I used to work for a Telecom's company and at one point was involved in integrating our CMIP stack (millions of lines of C/C++ code) with network management solutions from vendors like IBM and HP etc. I remember the DEC solution to get their OSI network event logs into our stack was based around 5000 lines of AWK code. AWK is awesome.

  • @rish1459
    @rish1459 3 года назад +2

    Regex, SED, & AWK are awesome tools!

  • @jzmmm
    @jzmmm 4 года назад +148

    In the early 90s: hey everyone, learn guis!!
    Today: hey let's go back to the command line!!!

    • @Hassan8Ola
      @Hassan8Ola 4 года назад +3

      GUIs are good for repeated, error-prone tasks. If you find yourself doing a task over and over again in which the task never changes, then build a GUI for it. But probably, that it is not likely, since the task can always improve and change. If you can isolate something so well that it can get its own GUI, then go for it. Nowadays, that is not easy to do.

    • @walker9893
      @walker9893 4 года назад +46

      @@Hassan8Ola this seems like the criteria for scripted automation....

    • @xijinpingpong4426
      @xijinpingpong4426 4 года назад +11

      GUI is nice if you need to see some visualized information or for entertainment. Terminal is nice for fast programs that have a specific task and work together with other programs.
      People who refuse one of them (GUI or Terminal) limit them self.

    • @diederick76
      @diederick76 3 года назад +7

      I think the general idea in the 90s was that for GUIs you didn't have to learn anything.

    • @MsHojat
      @MsHojat 3 года назад +3

      There's nothing stopping a program that is GUI that has all the functionality of a command line program, or even having a command line entry area inside of it. The problem is more with the fact that most or practically all GUI programs don't do this for some stupid reason!

  • @harrikoivula3932
    @harrikoivula3932 3 года назад +13

    I used to use GREP, AWK and SED in the 80's while porting a CAD program from on operating system to other. But nowdays I tend to use PERL and many times with excel. You can do many things with excel, but complex data manipulation tasks are much easier with perl. One of the best concepts in data manipulation with PERL and AWK are associative arrays.

  • @olimpiudehelean5885
    @olimpiudehelean5885 Год назад +1

    This has to be the best clarification i've ever seen . Thanks a lot !

  • @MadMetsFan
    @MadMetsFan 4 года назад +2

    I learned a lot of AWK about 20 years ago - very useful

  • @lactobacillusprime
    @lactobacillusprime 4 года назад +53

    Databases! I was using SQL and databases / Dbase for a long time. Can't stand using spreadsheets as databases.

    • @makoto_0
      @makoto_0 4 года назад +9

      Spreadsheet is just for office work, can't do anything for big data (It blows up at the moment you open the file). Database is the real deal.

    • @1MinuteFlipDoc
      @1MinuteFlipDoc 4 года назад +16

      the right tool for the right job!
      Nobody ever: I hate having to commute from LA to NY on a skateboard!

    • @jamess1787
      @jamess1787 4 года назад

      AMEN

    • @kencheng2929
      @kencheng2929 4 года назад +4

      Hi, I’m not a db admin but my feeling is spreadsheets are easier to use and they’re right in front of you. Databases need some kind of ui or they use the cli (inserts, selects). Please confirm/correct.

    • @jamess1787
      @jamess1787 4 года назад +3

      @@kencheng2929 hi Ken, you have a valid point. If you have a small set of data points to keep track of: then spreadsheets make sense. When you start to get into the 1000's+ then it's time to start looking into a database solution.
      Spreadsheets should be more for temporary data that has no long-term value. Like forecasting or basic customer metrics. =)

  • @ricsanders69
    @ricsanders69 4 года назад +7

    Reminds me of my days on the Sparc 2!! :-D Those were the days. *sigh*

  • @greyman1104
    @greyman1104 4 года назад +1

    Haven't understood awk before. Thanks, Gary!

  • @mywebuniversity
    @mywebuniversity Год назад +1

    Great quick demo of awk power.

  • @trueriver1950
    @trueriver1950 4 года назад +3

    awk one liners are great for ad hoc queries and I use it for that, but as soon as you go to scripting surely perl is the way to go?
    Or if you don't already know perl, then maybe Python which is more friendly for beginners?

    • @jrd33
      @jrd33 4 года назад +1

      Awk is great for awk-shaped problems (basically, report generation on files of simply-formatted ASCII data). If you have a different-shaped problem, don't use awk.

  • @adityathapan2544
    @adityathapan2544 4 года назад +6

    Your way of explaining topice is very easy... Please make more videos on linux

  • @aminebouaita9202
    @aminebouaita9202 3 года назад

    Thank you for the crystal clear demo !

  • @dragonsage6909
    @dragonsage6909 2 года назад +2

    Great tutorial, subbed.. thank you!

  • @khatharrmalkavian3306
    @khatharrmalkavian3306 3 года назад +3

    Kay, maybe if - for god knows what reason - I'm writing a super complex bash script.
    Even then, probably not. I generally just sub in a proper scripting language for that.

  • @faaaszoooom6778
    @faaaszoooom6778 3 года назад +7

    I use AWK a lot. And while I'm a C++ dev, I'd still recomment Python as a replacement for excel sheets for quick calculations.
    AWK has severe limitations, which makes it a bit harder to use for anything more complex than basic arithmetics (or string manipulations, but even that is a bit difficult sometimes.)

    • @huistelefoon5375
      @huistelefoon5375 2 года назад

      AWK has been used to create a full parser/tokenizer, and other purposes that are arguably way more complex for its intended use. Using AWK you can also make advantage of pattern matching with regular expressions, and AWK has many other tools for text manipulation. But I think one of the most powerful aspects of AWK is using it as a complimentary Unix tool. Use it together with other Unix command-line utilities with pipelining, not everything has to be done in AWK. For example, you can use AWK to parse out formatted words from a complex text file, and now pipe this data to be processed by another utility.

  • @dmays67
    @dmays67 Месяц назад

    Currently assessing how to extract useful data from multiple differently formatted fuel receipts here. Found your lovely little primer video very helpful - thanks!

  • @traal
    @traal 2 года назад +1

    GNU awk was the first scripting language I learned really well, and I wrote most of my early Bourne shell scripts as basically wrappers around huge chunks of awk code. Then I graduated to Perl, which is absolutely unmatched if you love regexes (I do!), and nowadays I write everything in Python if it's too much for a simple bash script. 😊
    I still use awk and Perl daily for oneliners when I do data wrangling. The awk syntax is super comfortable for the things that it is good at. 👍🏻

  • @hinkhall5291
    @hinkhall5291 2 года назад +7

    Most young devs don’t even know about commands that aren’t cd, ls, mv, cp or docker.

    • @xrafter
      @xrafter 2 года назад

      pacman -S firefox .
      Help this line above doesn't work .
      The error is weird 😞

    • @not_herobrine3752
      @not_herobrine3752 Год назад

      they have heard about the command line?

  • @MH-in8hz
    @MH-in8hz 3 года назад +3

    *Meanwhile*: *Dying in remorse for all the time I've wasted on learning how to use batch files syntax for Windows*
    what makes it even sadder is that I've always wanted to make use of what I've learned from Java especially when it comes to file management, bash scripts look a lot similar to Java, didn't expect Linux os to be this awesome, I've got bored from all the propaganda for Linux os but, now I understand. I'm woken at last 😂
    btw, you did a brilliant job on the rounding function, so satisfying.🤩

    • @0x007A
      @0x007A 2 года назад

      Sullying bash by comparisons to Java is heresy. Wash out your mouth with a bar of soap.

  • @TheLordoftheDarkness
    @TheLordoftheDarkness 3 года назад +1

    Thanks I finally found out how to run an awk script from a file. Also if you start your file with
    #! /usr/bin/awk -f
    and set the file to executable you can run the script with just
    ./script.awk

  • @TheSulross
    @TheSulross 3 года назад +1

    Need to install and use gawk instead of awk, though, as can then use a match function to match with regular expressions and then reference capture groups in the awk print command - this match is way better than just printing things that got separated into fields

  • @piiumlkj6497
    @piiumlkj6497 4 года назад +119

    Let's become terminal ninjas ;) , AWK is very used in linux commands

    • @jpdj2715
      @jpdj2715 3 года назад +1

      What do you expect from something that was developed in the terminal age? First stable release 1985. Developed by Aho, Weinberger and Kernighan (yes the Kernighan who wrote "The C Programming Language" with colleague Ritchie 1st ed. 1978). Or, when Linus scribbled his Ux, awk was already age old.

    • @uhmasyt
      @uhmasyt 3 года назад +2

      awk is slow, too.
      I remember just moving a script from awk to Perl I could reduce processing time by 70% and ended up with something like 30 minutes on a well equipped Unix machine.

    • @piiumlkj6497
      @piiumlkj6497 3 года назад +1

      Why perl when you've got python which has a much better syntax ? The best tool is the one available to you and the one you feel most comfortable using .

    • @lxathu
      @lxathu 3 года назад

      @@uhmasyt Really complex matches that perl can do in one step with its advanced regexps but one must has to write subroutines for can lead to perl's victory but most of the cases the burden that comes with that draws perl back.
      Awk if FAST with the task it was designed for.

    • @CARPB147
      @CARPB147 2 года назад

      @@lxathu - "but one must has to write subroutines for can lead to perl's victory".
      Not always. We can write Perl one-liners about as well as we can write AWK one-liners (that is, an entire script in a single command, without script files, or subroutines). It depends on the problem being solved, of course. Perl is pre-compiled and will run faster than AWK in most (all?) cases, and certainly scales-up more gracefully to solve more complex problems.

  • @jinchoung
    @jinchoung 4 года назад +4

    subscribed. dammit, I'm not a coder (much of) but you made that understandable even while talking blisteringly fast! kudos.

    • @CARPB147
      @CARPB147 2 года назад

      You can slow playback of RUclips videos when clicking on the cogwheel icon and selecting your playback speed.

  • @xtnctr
    @xtnctr 3 года назад +3

    Outstanding explaining skills!

  • @chitalvision
    @chitalvision 4 года назад

    I'm actually enjoying these technical videos of late!

  • @marcoandreknottjung3418
    @marcoandreknottjung3418 4 года назад +34

    Just a question from a newbie:
    what can I do with those informations?
    I come to your channel, via ColdFusion and the graphene battery!
    Thanks in advance

    • @fuseteam
      @fuseteam 4 года назад

      anything you want really ;)

    • @cfbasz
      @cfbasz 4 года назад +11

      You can manipulate information from files and extract what you want in the way you need. It is just pure formated text being manipulated. No spreadsheets needed. Cheers!

    • @hextz
      @hextz 4 года назад +10

      you can redirect the formated output of the awk script to another file, for example: I wanted to create a test file like the one used in the video but didn't know how to do it using only ls, so I used a "ls -la /usr/bin > ls-output.txt", then used awk to select only the fields in the order I wanted with "awk '{print $9,$5} ls-output.txt > ls-awk-output.txt". It's very handy to manipulate formated text files like csv, config files, logs, program outputs, whatever you can imagine...

    • @berndeckenfels
      @berndeckenfels 4 года назад +3

      See 14:47 for example

    • @claudehebert3131
      @claudehebert3131 4 года назад +7

      One example: Extract data from a not very user-friendly system, in a tab-delimited format. Convert it into SQL commands (using loads of "printf"). Run the generated SQL code to load the data in a database. AWK can be the glue between otherwise incompatible systems.

  • @skyler4517
    @skyler4517 3 года назад +13

    Idk, the idea of learning awk has been rattling around in the back of my head for a while, I just don't feel like it's worth the overhead when I could do all this just as easily in Python.

    • @lxathu
      @lxathu 3 года назад +1

      Of course you can, but when it comes to tricky interval pattern matching that "just as easily" can fade away quickly.

    • @MathieuDuponchelle
      @MathieuDuponchelle 3 года назад +3

      @@lxathu import re; ?

    • @gorgolyt
      @gorgolyt 3 года назад

      @@lxathu I don't think you're aware that `pandas` is a thing, and that it has regular expression functionality.

    • @user-ze7tl2dw4i
      @user-ze7tl2dw4i 3 года назад +1

      That's cool and pandas is great, but it doesn't beat efficient command line scripting. That's one of the areas I think perl is actually preferable to python

    • @lxathu
      @lxathu 3 года назад

      @@MathieuDuponchelle I don't mean interval of characters but interval of consecutive records.
      RE is nice, but RE without any ifs and mandatory indents in expressions that match ranges of records because the first one matches the first RE (or any expression) and the second one matches the second is nicer.
      Python can do anything. (G)awk can't but what it can, it can with beautifully short but still understandable codes.

  • @batchrocketproject4720
    @batchrocketproject4720 2 года назад

    Thanks, one of the better awk videos on youtube. I use awk scripts on files containing quite chaotic data that lacks the neat structure of csv and similar files. I feel the many comments here suggesting superiority of python, or even perl (which to some extent I agree with) for parsing file data might change once enlightened. To each his own, but my view is don't knock it til you try it. Being efficient with one scripting language does not preclude the possibility that you could be more proficient with another once mastered, especially one purpose-built to extract, manipulate, and reformat data.

  • @prabhujeeva2228
    @prabhujeeva2228 4 года назад +2

    Thanks Garry!!. It's really interesting