For everybody having a hard time with parentheses like Pat has @13:00 Tools -> "Global options "-> "Code" -> On the top to "Display" and then tick Rainbow parentheses
Your tutorials are great. I have a purely wet bio background and your videos helped me kickstart my computational biology literacy. Thank you for openly sharing your knowledge.
This tutorial in practice is very interesting. I did manage to run the entire code but using my local latitude and longitude as you suggested. I did work. My interested variables were TMAX and PRCP. In Rwanda we do not have SNOW. Thanks a lot.
This is great! I like how you build it up and have a specific goal in mind. This is also a problem any of us can tackle since the data is readily available. I typically write my own code for these sort of exercise (since I at least I can understand my own code) - that is how I learn best. I came up with a slightly different way of finding my "closet" weather station. I wrote a couple functions to do this - and tested the distance on Houston-Chicago and got pretty close. Here is how I tackled the problem. I set up two functions to run inside tidyverse - so used rlang (hence the enquo() and the bang bang !!). The first function converts to radians: radians_func % distinct(station) %>% pull(station) My closest station was about 500 m form my current location but has only operated for a couple of years. The filter gave me another station about 4 km away with a more extensive record. I decided to filter for stations with over 100 year record (although it is not clear what kind of record that is). It seems like the search should be more focused, though. What are we after? Temperature it seems. And it seems like that is the one variable most often measured.
Yeah I’ve thought about this but I’d probably build all the plots in the backend using a cron job or something. Then serve them up with minimal JavaScript. I don’t think the overhead of shiny would really be necessary 🤷♂️
I must add the vroom read it in fast (lazy loading I suspect) but I not so sure about the column allocations. It seems to have created new ones with mixed type data. So be aware.
I might be wrong but mehh, I'm just gonna make this assumption. Science in a nutshell 😅 Great tutorial sir. I always enjoy your videos since I learn so much more than what I came for (might you elaborate about top_n ? Couldn't quite grasp this one)
Thanks for the question! top_n returns the n rows (plus ties) for a particular variable that have the highest value. If you give it a negative number you’ll get the smallest values. There’s also slice_min and slice_max which are a bit similar
Pat, Webscraping has - at least in my mind - a different meaning that what you are doing here. It uses rvest etc. It might be misleading for those looking for actual webscraping.
For everybody having a hard time with parentheses like Pat has @13:00
Tools -> "Global options "-> "Code" -> On the top to "Display" and then tick Rainbow parentheses
You don’t like my “see if we get an error message”? 😂
Not at all! I'm loving it! But beginners often struggle with this stuff!
Cheers
@@sven9r 🤣
It's a good hint. But, interestingly, i did not have this option.
Thanks a tons Sir. I am in Germany and i was able to get the lattitude and longitude for my place. This is so incredible .
Wonderful - that's great! 🤓
Your tutorials are great. I have a purely wet bio background and your videos helped me kickstart my computational biology literacy. Thank you for openly sharing your knowledge.
My pleasure! Thanks for watching David 🤓
This tutorial in practice is very interesting. I did manage to run the entire code but using my local latitude and longitude as you suggested. I did work. My interested variables were TMAX and PRCP. In Rwanda we do not have SNOW. Thanks a lot.
Wonderful! I'm glad to hear you got it working. Sorry that you all miss out on snow 😂
This is my favorite video of yours. It is so useful for what I want to do. Thanks!
That’s awesome to hear! What part do you find most useful?
This is great! I like how you build it up and have a specific goal in mind. This is also a problem any of us can tackle since the data is readily available.
I typically write my own code for these sort of exercise (since I at least I can understand my own code) - that is how I learn best. I came up with a slightly different way of finding my "closet" weather station. I wrote a couple functions to do this - and tested the distance on Houston-Chicago and got pretty close. Here is how I tackled the problem.
I set up two functions to run inside tidyverse - so used rlang (hence the enquo() and the bang bang !!).
The first function converts to radians:
radians_func %
distinct(station) %>%
pull(station)
My closest station was about 500 m form my current location but has only operated for a couple of years. The filter gave me another station about 4 km away with a more extensive record. I decided to filter for stations with over 100 year record (although it is not clear what kind of record that is).
It seems like the search should be more focused, though. What are we after? Temperature it seems. And it seems like that is the one variable most often measured.
Well done!
+ looks like a fun assignment to create a shiny dashboard containing time series plots of this data
Yeah I’ve thought about this but I’d probably build all the plots in the backend using a cron job or something. Then serve them up with minimal JavaScript. I don’t think the overhead of shiny would really be necessary 🤷♂️
Great episode as always! I just ended a course about german raster data with some students :) !
Awesome! As always thanks for watching 🤓
Excellent! There's one station in my city!
Wonderful Victor!
I love this, use the rainbow parentheses btw!!
Hah! I try to stick close to the defaults so beginners don't get too freaked out when they see something that looks different from their computer
I must add the vroom read it in fast (lazy loading I suspect) but I not so sure about the column allocations. It seems to have created new ones with mixed type data. So be aware.
Some times the simpler packages are good enough
I'm having issues finding the same website as shown in 1:45 and beyond. Any info on how the path has changed from a year ago?
I just checked it and everything was working. Perhaps the site was down when you tried.
I had no trouble pulling up data for my best neighborhood station. However, my question is the temperature - what is the unit? Kelvin?
I think that was a question that is flashed in the last 5 min or so of the episode. I’ll definitely cover it in tomorrows episode
Pat,
I used vroom to read in the file and it read it fast and detected the columns. The only thing I had to do was to clean the column names.
Great - I haven’t tried vroom yet
I might be wrong but mehh, I'm just gonna make this assumption.
Science in a nutshell 😅
Great tutorial sir. I always enjoy your videos since I learn so much more than what I came for (might you elaborate about top_n ? Couldn't quite grasp this one)
Thanks for the question! top_n returns the n rows (plus ties) for a particular variable that have the highest value. If you give it a negative number you’ll get the smallest values. There’s also slice_min and slice_max which are a bit similar
TMAX looks very high, is that combining rows?
answered my question - TMAX = Maximum temperature (tenths of degrees C)
Good sleuthing! I’ll fix this and the precipitation in the next video 🤓
where is this script, please?
Pat,
Webscraping has - at least in my mind - a different meaning that what you are doing here. It uses rvest etc. It might be misleading for those looking for actual webscraping.
🤷♂️I’m getting data from a website. It’s a form of webscraping
Must be F with errant readings...
Nope. Stay tuned…