Using awk in Linux to create arrays and count in use login shells

Xargs Should Be In Your Command Line Toolbag

Unmasking the Linux Umask

Hammer Jam Film: A New Character appears!

Why Are Cooling Towers Shaped Like That?

Kodak Black - Catch Fire [Official Music Video]

Find Duplicate Files In Linux With Awk In Under A Minute!

theurbanpenguin

Просмотров 8 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 7 ноя 2024
AWK is powerful and can be your friend in Linux. Here we show how we can use awk to detect duplicate files in Linux.Taking output from md5sum we can see duplicate content. Passing that to AWK we can create arrays for each entry
md5sum *
md5sum * | awk ' brace bracket count[$1]++ brace bracket'
md5sum * | awk 'brace bracket count[$1]++ brace bracket END brace bracket for (k in count) print count[k] brace bracket'
Additionally you can find my video courses on Pluralsight: pluralsight.com... and take time to see my own site www.theurbanpen...
~-~~-~~~-~~-~
Please watch: "RHCSA 9 Working With Podman Containers"
• How To Use Podman Cont...
~-~~-~~~-~~-~

Комментарии • 23

@WoutiecomNL Год назад ⁺³
Wow, GREAT tutorial. I used 'rdfind' for finding duplicate files within Linux. But not having to install aditional software is a huge advantage for me. Thanks!
@theurbanpenguin Год назад ⁺¹
Thanks, did not know of rdfind
@scruffyjohn5234 Год назад
I'm gonna be honest with you. You make some of the best videos on the Linux OS. Your direction and explanation of the bash are absolutely awesome. I hope you can make videos more frequently.
@Pedro-fd9tv Год назад ⁺³
Great video. I always wanted to learn awk and this video was an excellent start!
@theurbanpenguin Год назад ⁺²
Thank you
@gaiusbaltar7122 Год назад ⁺³
Thanks a lot for all this valuable informations you give to us!
@theurbanpenguin Год назад ⁺¹
:)
@sozinonl Год назад ⁺¹
just thank you, I owe a lot to you and your training videos
@theurbanpenguin Год назад
thank you, and congratulate yourself for your own effort in learning
@terry.chootiyaa Год назад ⁺¹
*when are you going to upload new vids ?*
@allisondealmeida Год назад
hello, how do you prepare for an lpic3 if there is no book currently that has updated content?
@Gosu9765 Год назад ⁺³
No, thanks - I'll just use python :D
I love syntax of bash when you need to do anything even slightly more complex than list files.
In powershell if you slam your head against keyboard repeatedly you'll get syntax errors.
If you do the same in bash you'll get Kubernetes cluster, upgraded kernel and installed arch on a separate partition. :D
@screamingiraffe Год назад
Troubleshooting a fubar'd system would be interesting, or perhaps something instructional like Distributed SSH (DSH), or even a video covering the building a kernel for a system in 2023.
@pbezunartea Год назад ⁺¹
Thanks again for another nice video!
@theurbanpenguin Год назад
Thank you
@zlqpzww9929 Год назад
I got a question that if there was a danger of just deleting the duplicated file. As I know the message digest algorithm which is md5 in this video will generate the same result even if the source input is different 😂. Maybe I am over considered.😅
@petregmd Год назад
Is it possible to improve this command to have it look through directories recursively? As far as I can see md5sum does not have a -r or --recursive option.
@chaz6399 Год назад
find . -type f | xargs md5sum | awk '{ count [$1]++; name[$1]=name[$1] " " $2} END { for(k in count) if(count[k] > 1) print name[k] }' | sort
@RoboDragonJediKnight Год назад
I would recommend using a variation on find(1) to get a listing of files recursively.
```bash
# Find all files (not directories) under current working directory and execute md5 sum
find . -type f | xargs md5sum
```
Alternatively you could use "actions", which is a feature supported directly by find (man find) to eliminate the need for xargs in the previous pipeline. That look something like this.
```bash
find . -type f -execdir md5sum "{}" \;
```
Another interesting method is this one, which generates a sequence of md5sum filepath commands and runs them by piping into bash:
```bash
find . -type f | awk '{print "md5sum " $1}' | bash
```
Note that these options might not handle cases like spaces in filenames and that kind of thing. Flags like -print0 for the find command and -0 for xargs might come in handy in those cases.
@jamesbaxter2812 5 месяцев назад
Sir, I have over 50000 dups on my laptop. looking at your vid. looking at asking for your help, thanks
@moreirajesse Год назад
Thank you.
@theurbanpenguin Год назад ⁺¹
thanks
@Foche_T._Schitt Год назад
Czkawka

Следующие

Автовоспроизведение

Using awk in Linux to create arrays and count in use login shells

Using awk in Linux to create arrays and count in use login shells

Xargs Should Be In Your Command Line Toolbag

Xargs Should Be In Your Command Line Toolbag

Unmasking the Linux Umask

Unmasking the Linux Umask

Hammer Jam Film: A New Character appears!

Hammer Jam Film: A New Character appears!

Why Are Cooling Towers Shaped Like That?

Why Are Cooling Towers Shaped Like That?

Kodak Black - Catch Fire [Official Music Video]

Kodak Black - Catch Fire [Official Music Video]

Minecraft Mob Battles: THE MOVIE

Minecraft Mob Battles: THE MOVIE

Linux Mint 21.3 - Cinnamon - Tool to Find Duplicate File Names.

Linux Mint 21.3 - Cinnamon - Tool to Find Duplicate File Names.

Why Use CAT Or GREP When You Can AWK?

Why Use CAT Or GREP When You Can AWK?

Duplicate File Killer in Linux - LEARN FDUPES

Duplicate File Killer in Linux - LEARN FDUPES

EVERYONE Needs to Learn a Little Bit of AWK!

EVERYONE Needs to Learn a Little Bit of AWK!

Awk: Hack the planet['s text]! (Presentation)

Awk: Hack the planet['s text]! (Presentation)

10 things you can do with Linux that you can't do with Windows

10 things you can do with Linux that you can't do with Windows

Finding Duplicate Files in Linux

Finding Duplicate Files in Linux

LCL 30 - sed - Linux Command Line tutorial for forensics

LCL 30 - sed - Linux Command Line tutorial for forensics

Cleaning up Duplicate Files And Images with AI - Czkawka

Cleaning up Duplicate Files And Images with AI - Czkawka

⚡5 МИНУТ НАЗАД! ТРАМП СДЕЛАЛ ПЕРВОЕ ЗАЯВЛЕНИЕ ПОСЛЕ ПОБЕДЫ! ЧТО БУДЕ ДАЛЬШЕ?

⚡5 МИНУТ НАЗАД! ТРАМП СДЕЛАЛ ПЕРВОЕ ЗАЯВЛЕНИЕ ПОСЛЕ ПОБЕДЫ! ЧТО БУДЕ ДАЛЬШЕ?

Easy hack to make sure your glass doesn't shatter!

Easy hack to make sure your glass doesn't shatter!

Зеленского жестко поимеют

Зеленского жестко поимеют

ШАШЛЫКИ НА БАЛКОНЕ #советдня #мудрость #лайфхак #юмор #смешно

ШАШЛЫКИ НА БАЛКОНЕ #советдня #мудрость #лайфхак #юмор #смешно

От первого лица: Школа 7😡ПОТЕРЯЛ ПАМЯТЬ 🤯 ПРИЗНАЛСЯ в ЛЮБВИ на СЦЕНЕ💔 СБИЛА МАШИНА ГЛАЗАМИ ШКОЛЬНИКА

От первого лица: Школа 7😡ПОТЕРЯЛ ПАМЯТЬ 🤯 ПРИЗНАЛСЯ в ЛЮБВИ на СЦЕНЕ💔 СБИЛА МАШИНА ГЛАЗАМИ ШКОЛЬНИКА

Give male police officers the most tiring car#Short #Officer Rabbit #angel

Give male police officers the most tiring car#Short #Officer Rabbit #angel

Слитые секреты из договоров о неразглашении #апвоут #реддит #апвоутистории

Слитые секреты из договоров о неразглашении #апвоут #реддит #апвоутистории

Мой тг: Подвал Стинта #стинт #stint #stintik

Мой тг: Подвал Стинта #стинт #stint #stintik