- Видео 21
- Просмотров 2 478
R Govys
Добавлен 2 июн 2023
R Govys is sponsored by the American Statistical Association, the R Consortium, and Data Community DC. It is a gathering of those (professionals, educators, researchers, and more) who are interested in using R (and open-source software) for government applications, and hence for the benefit of the broader public.
R Govys December 2024 Seminar: Some History of Open-Source Software
Speaker Doug Bates has been involved in the development of software for data science for over 50 years and has had the pleasure of being part of the R community since its inception. It wasn't always clear that open source projects like R would be as successful as they have been. He describes his view of the development path and how we have ended up with many high-quality tools for data science. A sometimes overlooked aspect of having many such tools is the desirability of using cross-platform data representations such as Arrow tables for exchange of structured data, which is also discussed. Slides available at dmbates.quarto.pub/
Просмотров: 92
Видео
R Govys Nov 2024: Fast and Accurate Data Integration in Social Sciences with fastLink
Просмотров 5821 день назад
The R fastLink package implements a fast and scalable algorithm for probabilistic record linkage. fastLink includes functionalities to conduct a merge of two datasets under the Fellegi-Sunter model using the Expectation-Maximization algorithm. fastLink implements methods described in Enamorado, Fifield, and Imai (2019) "Using a Probabilistic Model to Assist Merging of Large-scale Administrative...
R Govys October 2024 Comparing Open-Source Record Linkage Software
Просмотров 482 месяца назад
Comparing Open-Source Record Linkage Software to Existing Approaches: A Case Study Using Census Data by Dakota Arthun. The U.S. Census Bureau has used entity resolution, also referred to as record linkage or deduplication, for the decennial censuses and other programs such as post-enumeration surveys for several decades. In particular, entity resolution was used on names and common characterist...
September 2024 Deploying R in a Secure Environment
Просмотров 532 месяца назад
Much like the superior Betamax, R often gets short shrift when it comes to enterprise tooling. This leaves data scientists to their own means to get a functioning IDE and access to the nearly 20,000 CRAN packages. This talk showcases how we setup up our environment on a TS network, providing the data scientists with the latest tools all while fulfilling security requirements. Given by Jared P. ...
R Govys August 2024 Seminar: Designing Against Bias in Machine Learning and AI
Просмотров 874 месяца назад
"Designing Against Bias in Machine Learning and AI" David J Corliss, PhD (he/him) Principal Data Scientist, Grafham Analytics Here is the abstract: Bias in machine learning algorithms is one of the most important ethical and operational issues in statistical practice today. This presentation describes common sources of bias and how to develop study designs to measure and minimize it. Analysis o...
R Govys July 2024: Splink - Free Software for Probabilistic Record Linkage
Просмотров 3144 месяца назад
Robin Linacre (Ministry of Justice, UK) is one of the developers of Splink, which is free software for probabilistic record linkage. Splink is Python based, which could be called via R. Robin gives an overview of the latest version of Spink.
R Govys June 2024: Predictive Latent-Class Modeling
Просмотров 534 месяца назад
Joe Schafer from the US Census Bureau presents his new package currently named bigLC for fitting single-and multilevel LC models using EM or MCMC. It currently accepts categorical and count variables.
R Govys May 2024 Seminar: Using the censusapi Package
Просмотров 827 месяцев назад
Nora Trow Shaw shows us how to use the censusapi R package to download data for wrangling and analysis.
R Govys Seminar April 2024: R and AWS Lambda
Просмотров 767 месяцев назад
R Govys Seminar April 2024: R and AWS Lambda
R Govys Feb 2024: Generative AI: A Survey of Current Practices, Challenges, and Best Practices
Просмотров 6779 месяцев назад
Presentation at R Govys by Rajiv Shah (Snowflake).
October 2023 R Govys: Two Tools for Creating Graphics in R
Просмотров 6510 месяцев назад
Two Tools for Creating Graphics in R The October seminar will feature two presenters- Wendy Martinez and Jessica Klein from the US Census Bureau. The speakers will give presentations on two tools useful for creating data visualizations. First, Wendy will be discussing the micromapST package (available on CRAN), that implements linked micromaps, specifically for US state data. The package provid...
R Govys Jan 2024: Bringing generative AI to RStudio with GitHub Copilot and chattr
Просмотров 24710 месяцев назад
Presenter: Tom Mock: Product Manager for Posit Workbench and RStudio IDE Webinar 01/18/2024 SPONSORED BY: RGOVYS, THE ASA COMMITTEE ON DATA SCIENCE & AI, AND THE ASA STATISTICAL COMPUTING SECTION Code generating AI tools like GitHub Copilot™ promise an "AI pair programmer that offers autocomplete-style suggestions as you code". For the first time, we'll show a native integration of Copilot into...
R Govys September 2023 Text Analysis Through Time
Просмотров 66Год назад
"KEY ISSUES IN THE HISTORY OF STATISTICS AS SEENTHROUGH TEXT ANALYSIS ON 114 YEARS OF ASAPRESIDENTIAL ADDRESSES" BRANDON KOPP, US Bureau of Labor Statistics IN HER 2020 ASA (American Statistical Association) PRESIDENTIAL ADDRESS, WENDY MARTINEZ DISCUSSED AN R SHINYAPPLICATION THAT DISPLAYED THE RESULTS OF TEXT ANALYSIS ON OVER 110 YEARS OFPREVIOUS ASA PRESIDENTIAL ADDRESSES. IN THIS PRESENTATIO...
March 2023 cvam: An R Package for Modeling Coarsened Categorical Data
Просмотров 28Год назад
cvam: An R Package for Modeling Coarsened Categorical Data Speaker: Joseph L. Schafer United States Census Bureau Coarsened data can express intermediate states of knowledge between fully observed and fully missing. For example, when classifying survey respondents by cigarette smoking behavior as 1=never smoked, 2=former smoker or 3=current smoker, we may encounter some who reported having smok...
February 2023 Intro to text analysis in R
Просмотров 51Год назад
In this session Tomas Drgon will introduce the basics of text mining in R (package TM)… Corpus creation, text prep, Document Term Matrices; and provide a live coding example of analysis of Document Term Matrices employing Principal Component Analysis and Hierarchical Clustering. Tomas has a MSc in Food Biotechnology from Slovak Institute of Technology and PhD in Biochemistry (1994). He did post...
January 2023 ASA Biopharmaceutical Section (BIOP) Software Engineering Working Group (SWE WG)
Просмотров 22Год назад
January 2023 ASA Biopharmaceutical Section (BIOP) Software Engineering Working Group (SWE WG)
August 2023 EPA's Data Management and Analytics Platform
Просмотров 46Год назад
August 2023 EPA's Data Management and Analytics Platform
April 2023 Chat GPT: The utility and risks of large language models for US agencies
Просмотров 19Год назад
April 2023 Chat GPT: The utility and risks of large language models for US agencies
April 2022 - Data Modeling at the US Congressional Budget Office and Stat Canada
Просмотров 15Год назад
April 2022 - Data Modeling at the US Congressional Budget Office and Stat Canada
May 2023 Learning to Use the tidycensus Package
Просмотров 357Год назад
May 2023 Learning to Use the tidycensus Package
would have been great if all these codes were in the description
Thanks so much for such a useful tutorial. It allowed me to start coding with AI in RStudio and now things are much easier