-
Yasuto NAKANO (Kwansei Gakuin University)07/07/2026, 17:00Poster
The purpose of this talk is to present md2qstn, a specialized R library developed to bridge the gap between plain-text survey drafting and digital deployment. md2qstn enables the conversion of Markdown-formatted questionnaires into DDI(Data Documentation Initiative)-compliant XML and Qualtrics-compatible QSF(Qualtrics Survey Format) JSON files. Although the prevailing approach in...
Go to contribution page -
Isaac Gravestock (Roche)07/07/2026, 17:00Poster
Google slides is a widely available productivity tool used by many institutions but is not well integrated with existing R ecosystem workflows such as Rmarkdown, which has made its use incompatible with reproducible research and reporting. ladder is an R package for inserting tables into Slides presentations and supports multiple table formats from R.
In particular it supports flextable...
Go to contribution page -
Prof. Richard Shefferson (University of Tokyo)07/07/2026, 17:00Poster
Adaptive dynamics focuses on assessing the role of natural selection in trait evolution and speciation. Matrix community models allow population matrix models to project together via aggregated density dependence. I present a new R package, adapt3, that develops community matrix projection and adaptive dynamics using matrix approaches, with the core kernels all programmed in C++. In adaptive...
Go to contribution page -
H. Sherry Zhang (University of Texas at Austin)07/07/2026, 17:00Poster
Decision choices, such as those made when building regression models, and their rationale are essential for interpreting results and understanding uncertainty in an analysis. However, these decisions are rarely studied because tracing every alternatives considered by authors is often impractical, and reworking a completed analysis is generally of limited interest. Consequently, researchers...
Go to contribution page -
shristi y (IIIT UNA)07/07/2026, 17:00Poster
The widespread adoption of digital music streaming platforms has created unprecedented opportunities to analyze large-scale music consumption data. This study investigates global music trends by analyzing Spotify track data using statistical and visualization techniques implemented in R. The objective is to explore how various audio features—including danceability, energy, valence,...
Go to contribution page -
Florian Sihler (Ulm University)07/07/2026, 17:00Poster
In the past months, we built a tool to analyze all versions (roughly 170,000) of all packages available on CRAN, obtaining around 80 GB of raw data on various semantic aspects such as call graphs of functions, dead code, values of constants, the coverage of provided vignettes, transitive dependencies of packages, and much more. Moreover, the data is linked to the release date and...
Go to contribution page -
Claudiu Forgaci (Delft University of Technology)07/07/2026, 17:00Poster
Spatially designing and planning urban transformations around rivers while capturing the complexities of riverside urban areas remains challenging. An essential part of the challenge is how boundaries are drawn in the analysis of urban areas surrounding rivers. To overcome this challenge, we developed the
Go to contribution pagercrispopen-source R package to automate the morphological delineation of riverside... -
Claudiu Forgaci (Delft University of Technology)07/07/2026, 17:00Poster
The Spatial Data Science across Languages (SDSL) Community brings together developers and users of common and emerging programming languages for spatial data science. It aims to foster understanding and address common issues while discussing language-specific problems. We focus broadly on geospatial and geographic space, with some applications to general image spaces and local reference...
Go to contribution page -
Daisuke Ichikawa (Kibaroku), Koji Makiyama (HOXO-M Inc.), Shinichi Takayanagi, kazuyuki sano07/07/2026, 17:00Poster
Online A/B tests often randomize at the user level while evaluating ratio metrics at a finer-grained unit, such as page views or sessions. This mismatch induces within-user correlation and can make standard Z-tests anti-conservative, increasing false positives. The deltatest package provides an R interface for delta-method-based hypothesis testing of ratio metrics, following the practical...
Go to contribution page -
174. Flexible Aggregation with SUOWA Operators in R. An Implementation Based on the Choquet IntegralTeresa Gonzalez-Arteaga (Universidad de Valladolod)07/07/2026, 17:00Poster
Aggregation functions play a central role in decision making, and among them, weighted means and Ordered Weighted Averaging (OWA) operators are two of the most widely used families. Their relevance is reinforced by the fact that both can be expressed as particular cases of the Choquet integral, which has inspired numerous attempts to develop unified generalizations of these operators.
Go to contribution page
... -
Mauro Loprete (Universidad de la República, Uruguay)07/07/2026, 17:00Poster
Household survey microdata is a primary input for social science research and public policy evaluation, yet the processing pipelines that turn raw microdata into publishable estimates are rarely documented, shared, or reproduced. Each research team writes ad hoc scripts to recode variables, construct indicators, and compute weighted statistics, duplicating effort and introducing silent...
Go to contribution page -
Dr Nicholas Spyrison (IFF (International Flavors and Fragrances))07/07/2026, 17:00Poster
Industrial microbial production systems generate rich process data, yet translating these data into actionable parameter recommendations remains challenging. In this talk, we present a model based framework for generating and interpreting recommendations to optimize microbial production parameters using supervised machine learning.
Using two distinct industrial probiotic strains (GG and...
Go to contribution page -
Ernest Guevarra (nutriverse)07/07/2026, 17:00Poster
nutriverse is an open source project, a collective, and a community of practice. nutriverse is an open source project developing robust, well-tested, and performant R packages for nutrition data analysis. The goal is to provide reliable tools that support the full lifecycle of nutrition analytics, from data ingestion and cleaning to statistical analysis, modelling, and reproducible...
Go to contribution page -
Ward Langeraert (Research Institute for Nature and Forest)07/07/2026, 17:00Analysis best practices and workflowsPoster
Scaling research software beyond single scripts or standalone packages requires deliberate architectural choices, shared conventions, and robust distribution infrastructure. This poster presents the b3verse, a coordinated ecosystem of twelve interoperable R packages designed to transform large biodiversity occurrence cubes into standardized indicators for research and policy...
Go to contribution page -
Dr Oscar de Leon (Universidad del Valle de Guatemala)07/07/2026, 17:00Poster
Air pollution exposure research relies on a growing diversity of wearable personal exposure monitors (PEMs), each producing log files with distinct header structures, column naming conventions, and measurement units. The R ecosystem already offers strong infrastructure at adjacent layers for network-level data (
Go to contribution pageopenairandAirSensor), on-road vehicle emission systems (pems.utils), and... -
Mr Marc Becker (Ludwig-Maximilians-Universität München)07/07/2026, 17:00Poster
We present rush, an R package for asynchronous and decentralized optimization. Traditional approaches for parallel computing in R follow a controller-worker model where a central process proposes tasks, dispatches them to workers, and collects results. When proposing new tasks is computationally expensive, the central controller becomes a bottleneck that leaves workers idle, a problem that...
Go to contribution page -
Vihan Singh (Indian Institute of Information Technology Una)07/07/2026, 17:00Poster
The increasing availability of structured sports datasets has created new opportunities for applying statistical analysis and predictive modeling techniques to sports analytics. The Indian Premier League (IPL) provides detailed match and ball-by-ball datasets that allow in-depth statistical exploration of match dynamics and performance patterns. This study applies statistical analysis and...
Go to contribution page -
Vedansh Bansal (Indian Institute of Information Technology Una)07/07/2026, 17:00Poster
Student performance analysis is an important area in educational data science. This project focuses on building a Student Performance Analyzer to study and evaluate academic performance using collected data such as marks, attendance, and study hours. The objective is to use statistical analysis and data visualization techniques to understand patterns in student performance and support better...
Go to contribution page -
Mohamed El Fodil Ihaddaden (HDI GLOBAL SE)07/07/2026, 17:00Poster
Large Language Models (LLMs) introduce a fundamental challenge for software engineering in R: their non-deterministic behavior makes traditional unit testing inadequate. While identical prompts may yield slightly different outputs, robust validation of model behavior remains essential for production systems, research pipelines, and agent-based workflows.
In this talk, I introduce mini007,...
Go to contribution page -
Claudiu Forgaci (Delft University of Technology)07/07/2026, 17:00Poster
The Rbanism community aims to empower urbanism researchers, students, educators and practitioners to use open-source software and related open-science practices effectively and with confidence. It raises awareness, stimulates engagement and builds capacity by demonstrating the benefits of reproducibility, automation and scalability. Rbanism was initiated in 2021 by a group of R users in the...
Go to contribution page -
Hanna Meyer (University of Münster)07/07/2026, 17:00Poster
One key task in environmental science is the continuous mapping of environmental variables across space, and often across both space and time. Machine learning algorithms are frequently employed for this purpose, combining local field observations with comprehensive sets of predictor variables to produce spatial predictions. This enables the prediction of the variable of interest at locations...
Go to contribution page -
Victor Yu (Hertfordshire County Council, UK)08/07/2026, 16:00Poster
This package allows the user to perform interrupted time series (ITS) with a control across successive interventions (up to 3). This code is based on a prior analysis done at our county where we compared the effect of two successive behavioural interventions designed in improving the uptake of a COVID-19 booster intervention programme amongst immunosuppressed patients at several primary care...
Go to contribution page -
Dr Filip Křikava (Czech Technical University in Prague)08/07/2026, 16:00Poster
Dynamic programming languages are increasingly adopting explicit type annotations. Not only do they serve as documentation, but they also enable static type checking to eliminate entire classes of bugs and help tools provide a better development experience. In this talk, we will present our advancements in bringing types to R, including a type system with a static type checker with type...
Go to contribution page -
Anurag Yadav08/07/2026, 16:00Poster
Artificial satellites play a critical role in modern communication, navigation, weather monitoring, and scientific research. Over the past decades, the number of satellites orbiting Earth has increased rapidly due to advancements in space technology and the growth of commercial satellite constellations. Understanding the distribution of satellites and their potential impacts is important for...
Go to contribution page -
Patryk Kołbyko (Szkoła Doktorska Nauk Społecznych UMCS. Uniwersytet Marii Curie-Skłodowskiej w Lublinie)08/07/2026, 16:00Poster
This study presents an end-to-end R-based workflow for estimating Poland’s natural rate of interest within a Bayesian vector error-correction setting. The empirical objective is to recover an equilibrium real interest rate and the associated monetary policy stance gap, whereas the methodological contribution lies in demonstrating how advanced macroeconometric inference can be structured,...
Go to contribution page -
Ozancan Ozdemir (University of Groningen)08/07/2026, 16:00Poster
The increasing complexity of financial markets demands analytical tools that combine real-time data access, rigorous statistical modelling, and intuitive visual communication within a single, reproducible framework. This study presents FinDash Pro, a production-grade interactive dashboard developed entirely in R using the Shiny ecosystem, designed to bridge the gap between...
Go to contribution page -
Serra İlayda Yerlitaş Taştan (Department of Biostatistics, Erciyes University, Faculty of Medicine, 38030, Kayseri, Türkiye)08/07/2026, 16:00Poster
Accurate diagnosis often requires the integration of multiple biomarkers rather than relying on a single test. However, existing tools for combining diagnostic tests are limited in methodological diversity and usability, especially for clinicians without programming expertise. To address this gap, we present dtComb-Shiny, a user-friendly web-based interface built on the dtComb R package. The...
Go to contribution page -
Ms Daphne Grasselly (Roche), Magdalena Krochmal (Roche)08/07/2026, 16:00Poster
Medical Data Review (MDR) in clinical trials requires study teams to examine patient-level data across dozens of CRF domains — adverse events, labs, vitals, ECGs, and more. Traditionally, this relies mainly on static listings generated per study, requiring extensive setup and line-by-line inspection. We present an R framework, built on teal, that replaces this workflow with interactive,...
Go to contribution page -
Karolina Widzisz (Department of Computer Graphics, Vision and Digital Systems, Silesian University of Technology, Gliwice, Poland)08/07/2026, 16:00Poster
We present a synthetic data generator for simulation studies in clustering and partition comparison. The generator creates datasets with controlled cluster structures and predefined similarity levels between alternative partitions, enabling systematic analysis of clustering algorithms' stability.
The framework uses a Gaussian mixture distribution and generates data through a three-stage...
Go to contribution page -
Laure Cougnaud (Open Analytics NV)08/07/2026, 16:00Poster
The use of R packages in a regulated environment as in pharmaceutical companies might require a formal validation of the R package.
The Validation Hub introduces best practices and insights from pharmaceutical industries for the validation of R packages for use within the biopharmaceutical regulatory setting.
We will contribute to this effort by presenting a git-based workflow to...
Go to contribution page -
Marcin Dubel (Appsilon)08/07/2026, 16:00Poster
Building exploratory analysis dashboards for clinical trials requires considerable expertise, extensive time, and deep familiarity with specialized frameworks. In this talk, we share our GenAI solution to significantly streamline this process. We will present a tool, powered by Claude Code, that enables biostatisticians and clinical researchers to effortlessly create and immediately preview...
Go to contribution page -
Adam Forys (Roche), Magdalena Krochmal (Roche)08/07/2026, 16:00Poster
AI code assistants such as Claude Code, opencode, and Aider can read, write, and run code. However, they work separately from the user's R session. They cannot look at live objects, call R functions, or update a running Shiny application. We present a way to connect these assistants directly to R and Shiny using the Model Context Protocol (MCP).
The main idea is to use CLI-based AI agents...
Go to contribution page -
Winkle Lu08/07/2026, 16:00Poster
Clinical trial data analysis typically focuses on specific analysis datasets, but the complete journey of individual patients — from screening, enrollment, and first dose, through visit records and adverse events, to last dose and survival status — represents critical time-based data points that reviewers prioritize. This fragmentation of information forces reviewers to switch between multiple...
Go to contribution page -
Angelika Meraner (Statistics Austria)08/07/2026, 16:00Poster
persephone3 is the updated R framework developed at Statistics Austria to enable efficient processing of large sets of time series in the production of seasonally adjusted estimates. It modernizes the original [persephone][1] package by moving from the RJDemetra backend to the new [rjd3 ecosystem][2], ensuring long term maintainability and compatibility with current JDemetra+...
Go to contribution page -
Dr Jan Simson (LMU Munich)08/07/2026, 16:00Poster
We present peRsian, an R package containing color palettes based on handcrafted Persian carpets for use in data visualization. peRsian is a tribute to centuries of Persian carpet-making, a craft that’s been alive for over two thousand years. It’s dedicated to the incredible artisans who’ve kept this tradition alive: especially the women who spent countless hours knotting and weaving every...
Go to contribution page -
Khanh Do (Deakin University), Vedanti Padhye (Monash University)08/07/2026, 16:00Poster
While R is famous for its usage as a statistics tool, it also has the potential to serve as a practical tool for day-to-day corporate operations. To unlock this practical value, it often relies on the ability to deliver complex mathematical backends through intuitive interfaces. In this presentation, we will explore the core principles of building enterprise-ready applications by showcasing a...
Go to contribution page -
Yuki Yanai (Kochi University of Technology)08/07/2026, 16:00Poster
We have developed rgamer, an R package for learning and applying game theory. The goal of rgamer is to support both teaching and learning by enabling students to explore game-theoretic concepts and instructors to demonstrate them effectively in R. The package not only solves standard models such as two-person normal-form games, but also provides visualizations that highlight key structural...
Go to contribution page -
Joanna Zyla (Department of Data Science and Engineering, Silesian University of Technology, Gliwice, Poland)08/07/2026, 16:00Poster
Gaussian Mixture Modeling (GMM) is a one of unsupervised techniques used in many fields of data analysis, such as bioinformatics, pattern recognition, and network traffic analysis. Yet, existing R implementations often lack support for binned data (commonly observed in image analysis) and suffer from initialization instability or massive memory usage. To address these limitations, the novel R...
Go to contribution page -
Daisuke Ichikawa (Kibaroku), Koji Makiyama (HOXO-M Inc.), Shinichi Takayanagi, kazuyuki sano08/07/2026, 16:00Poster
TheseusPlot is an R package for explaining why a rate metric (e.g., conversion rate, retention rate, or on-time rate) differs between two groups, such as time periods, cohorts, or A/B variants. The package decomposes an overall difference into contributions from individual subgroups using a procedure inspired by the Ship of Theseus: starting from Group A, it replaces subgroup data with the...
Go to contribution page -
Leila Kianmehr08/07/2026, 16:00Poster
Autosomal Dominant Polycystic Kidney Disease (ADPKD), the most common hereditary kidney disease, exhibits marked clinical heterogeneity driven by complex molecular mechanisms. While single-omics studies identify isolated pathways, defining the coordinated mechanistic framework of disease remains a challenge. In this study, we present an end-to-end R-based workflow to integrate high-throughput...
Go to contribution page -
Dr Wang Pok Lo (University of Oxford)08/07/2026, 16:00Poster
Simulation studies allow comparisons of performance between statistical methods to be made. Tables are traditionally used to report study results, which are usually performance measures such as bias, empirical standard error, average model standard error and coverage. In large simulation studies, these tables of results may become too large for patterns to be readily identified. This occurs...
Go to contribution page
Choose timezone
Your profile timezone: