Generating and classifying bootstrap replicates with test driven development (CC283)

Minimum Window Substring | Leetcode #76

How to Become a Highly Employable C# Developer in 2025

Gas Fruit Is The MOST OVERPOWERED Fruit.. (Blox Fruits)

Rory McIlroy, Scottie Scheffler vs Bryson DeChambeau, Brooks Koepka | Crypto.com Showdown Highlights

Blox Fruits Dragon Rework Update [Full Stream]

base R, stringi, and stringr: Benchmarking string manipulations with (CC282)

Riffomonas Project

Просмотров 1,1 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 3 фев 2025
Наука

Комментарии • 18

@YannC-p1q 8 месяцев назад ⁺¹
I just went through all my code and switched to the stringi version of it! stringr has str_squish() which i love, that can be replaced with stri_replace_all_regx ("\\s+, " ") and stri_trim() - and still much faster. Thank god also for the wonderful GPT engines, that really help make this transition once you provide the stringr version of the code!!
@YannC-p1q 8 месяцев назад ⁺¹
Hey, I found that tidyr's replace NA is actually better, and also that for this usecase, stringr was even faster than stringi. Results for 100 evals, microseconds:
tidyr median: 1617.70
stringr median: 7963.00
stringi median: 8015.65
base median: 1590.65
for tidyr, stringr and stringi i used mutate(x,lang = {package}::{function}(lang,"en"))
for base I used x$lang[is.na(x$lang)]
@belantaribrahim850 8 месяцев назад ⁺²
I learn a lot from each of your videos...thank you
@Riffomonas 8 месяцев назад
my pleasure - thanks for watching!
@djangoworldwide7925 8 месяцев назад ⁺²
Wow. Had no idea stringi is so much more performant. I actually like its syntax for some cases. Will definitely move to using it
@Riffomonas 8 месяцев назад
same here! thanks for watching 🤓
@theproblembelief7549 8 месяцев назад
I have followed a few of your videos and indeed I have learned a lot even though I do not know anything about DNA analysis!
@Riffomonas 8 месяцев назад
phew! 🤓 thanks so much for watching
@tedhermann3424 8 месяцев назад
Thanks for this series. It's been very helpful to watch the whole process. I would have been interested to see how much each change contributed to the overall performance boost of building the database. I'm guessing removing the for loop provided 90% of the decrease in time.
Also, have you looked at the targets package for pipeline management? I got started with Snakemake, but then found targets, which is incredibly feature-rich (e.g., easy parallelization and batching), made specifically for R, and has great documentation and a helpful, active developer. Might be worth a look for yourself or another series.
@Riffomonas 8 месяцев назад
Thanks Ted! If I had to guess it was the substr/substring step. I think this might be what you're referring to. I'll have to check out targets. I like Snakemake because I'm often using numerous non-R tools and I like to have a single system for doing everything
@orgadish 8 месяцев назад
I think base R is slower because of issues with string encodings. I found, for example, that basename/dirname are much slower on Windows than Mac, and the R dev who fixed this (for an upcoming R version) noted it had to do with encodings.
I use those functions frequently on data frame columns (eg FilePath) where most of the column is repeated. So I created a lightweight package `deduped` that speeds up running on a vector with lots of duplication. It might help speed up your case, too (though it looks like your inputs are all unique, so may be not?).
Thanks for sharing your performance development journey!
@Riffomonas 8 месяцев назад
Interesting - thanks for tuning in!
@abdullahalmohamad244 8 месяцев назад
There is always an issue with the sound
@Riffomonas 8 месяцев назад
hrmmm, what's the issue?
@JordiRosell 8 месяцев назад
@@RiffomonasI think we listen the sound of your computer. It happened in multiple episodes.
@Riffomonas 8 месяцев назад
@@JordiRosell thanks. is it my typing on the keyboard or a fan? anyway, if you have a timestamp where it happens I'd be happy to see what I can do
@JordiRosell 8 месяцев назад
@@Riffomonas the fan from 0:00.

Следующие

Автовоспроизведение

Generating and classifying bootstrap replicates with test driven development (CC283)

Generating and classifying bootstrap replicates with test driven development (CC283)

Minimum Window Substring | Leetcode #76

Minimum Window Substring | Leetcode #76

How to Become a Highly Employable C# Developer in 2025

How to Become a Highly Employable C# Developer in 2025

Gas Fruit Is The MOST OVERPOWERED Fruit.. (Blox Fruits)

Gas Fruit Is The MOST OVERPOWERED Fruit.. (Blox Fruits)

Rory McIlroy, Scottie Scheffler vs Bryson DeChambeau, Brooks Koepka | Crypto.com Showdown Highlights

Rory McIlroy, Scottie Scheffler vs Bryson DeChambeau, Brooks Koepka | Crypto.com Showdown Highlights

Blox Fruits Dragon Rework Update [Full Stream]

Blox Fruits Dragon Rework Update [Full Stream]

The History of Super Mario’s Hidden Ending

The History of Super Mario’s Hidden Ending

AI Is Making You An Illiterate Programmer

AI Is Making You An Illiterate Programmer

Using ggplot2 to visualize relationship between life expectancy and health spending in R (CC338)

Using ggplot2 to visualize relationship between life expectancy and health spending in R (CC338)

All Rust string types explained

All Rust string types explained

10 Signs Your Software Project Is Heading For FAILURE

10 Signs Your Software Project Is Heading For FAILURE

Inside the V3 Nazi Super Gun

Inside the V3 Nazi Super Gun

3 Levels of Vim Refactoring

3 Levels of Vim Refactoring

Stat 412 9: Strings and Regular expressions with stringr

Stat 412 9: Strings and Regular expressions with stringr

Going for simple with ggplot2 and dplyr (CC320)

Going for simple with ggplot2 and dplyr (CC320)

R Tutorial | Regular Expressions in R

R Tutorial | Regular Expressions in R

Replacing the Battery Connector, These Types Have Many Similarities, What Types? #phonerepair

Replacing the Battery Connector, These Types Have Many Similarities, What Types? #phonerepair

iPhone vs Nokia ☠️ #trollface #edit #troll

iPhone vs Nokia ☠️ #trollface #edit #troll

Воздушные СО и СЖО, Noctua D15 G2 VS Assassin IV VC Vision VS СЖО 360.N D

Воздушные СО и СЖО, Noctua D15 G2 VS Assassin IV VC Vision VS СЖО 360.N D

ПК для компьютера 💅 #пк #компьютер #сборкапк #pc #rtx

ПК для компьютера 💅 #пк #компьютер #сборкапк #pc #rtx

НЕДЕЛЯ с Samsung Galaxy S24 FE - зачем КОРЕЙЦЫ так ОШИБАЮТСЯ? | ЧЕСТНЫЙ ОТЗЫВ

НЕДЕЛЯ с Samsung Galaxy S24 FE — зачем КОРЕЙЦЫ так ОШИБАЮТСЯ? | ЧЕСТНЫЙ ОТЗЫВ

🪫 intel vs snapdragon 🔋

🪫 intel vs snapdragon 🔋

Worlds first customizable hologram 🤩😳 #led #hologram #ledlights #shorts

Worlds first customizable hologram 🤩😳 #led #hologram #ledlights #shorts

Анимация логотипа для компании MYCOM

Анимация логотипа для компании MYCOM