@Thiyangt

36 Followers
40 Following
41 Posts
Before you continue to YouTube

Merry Christmas! Generated using R❤️

🦠🚀 Empowering #Dengue research through data! The Dengue Data Hub, an initiative by the R Consortium, is revolutionizing access to dengue-related data. Learn how researchers can easily use this resource with the denguedatahub package, Shiny app, and website.

🔗https://r-consortium.org/posts/empowering-dengue-research-through-the-dengue-data-hub/

#rstats #openscience #publichealth #DataHub #Dengue

Empowering Dengue Research Through the Dengue Data Hub: R Consortium Funded Initiative – R Consortium

The Dengue Data Hub, an ambitious initiative funded by the R Consortium ISC, transforms how researchers access and utilize dengue-related data.

Inspired by Monash University NUMBATS
research group, I’ve created all my course websites for the semester using Quarto! Open science not only shares knowledge but also sparks inspiration. Thank you, NUMBATS. #rstats #quarto #openscience

I made a small #rstats package for embedding an interactive directory listing as a HTML widget 📂

Great for demonstrating or discussing about folder structure in class or otherwise!

github.com/emitanaka/dir

Check out my package tsdataleaks to detect data leaks ( training data contains information about the test data) in forecasting competitions. Paper: https://arxiv.org/abs/2402.10522, Package: https://github.com/thiyangt/tsdataleaks #Rstat #timeseries #forecasting #DataScience
tsdataleaks: An R Package to Detect Potential Data Leaks in Forecasting Competitions

Forecasting competitions are of increasing importance as a means to learn best practices and gain knowledge. Data leakage is one of the most common issues that can often be found in competitions. Data leaks can happen when the training data contains information about the test data. There are a variety of different ways that data leaks can occur with time series data. For example: i) randomly chosen blocks of time series are concatenated to form a new time series; ii) scale-shifts; iii) repeating patterns in time series; iv) white noise is added to the original time series to form a new time series, etc. This work introduces a novel tool to detect these data leaks. The tsdataleaks package provides a simple and computationally efficient algorithm to exploit data leaks in time series data. This paper demonstrates the package design and its power to detect data leakages with an application to forecasting competition data.

arXiv.org
DSjobtracker 2.0.0 is now on CRAN! It compiles data from 1172 job ads in data science and statistics, making it an invaluable tool for learning key skills in these areas. Here is a word cloud of 97 requested skills. More info: https://github.com/thiyangt/DSjobtracker #rstats #Datascience
GitHub - thiyangt/DSjobtracker: What skills and qualifications are required for a data scientist?

What skills and qualifications are required for a data scientist? - thiyangt/DSjobtracker

GitHub
Over 100 participants joined
RLadies Colombo meetup, exploring the transformative journey "From Zero Lines to Endless Code: Navigating the Non-Coder to Coder." Stay tuned for future events by keeping an eye on our meetup page https://www.meetup.com/rladies-colombo/ #rladies
R-Ladies Colombo | Meetup

This is a local chapter of R-Ladies Global (https://www.rladies.org), an organization that promotes gender diversity in the R community worldwide. We meetup in person or virtually to learn about the R programming language, algorithms and advanced tools. R-Ladies welcomes members of all R proficiency

Meetup