We design Big Data infrastructures to
enable efficient, open and reproducible
New pre-print: Reducing numerical precision preserves classification accuracy in Mondrian Forests
New pre-print: Modeling the Linux page cache for accurate simulation of data-intensive applications
New pre-print: Data Augmentation Through Monte Carlo Arithmetic Leads to More Generalizable Classification in Connectomics
New pre-print: An Analysis of Security Vulnerabilities in Container Images for Scientific Data Analysis
New pre-print: Numerical Instabilities in Analytical Pipelines Lead to Large and Meaningful Variability in Brain Networks.
New pre-print: A benchmark of data stream classification for human activity recognition on connected objects.
New pre-print: Can we Estimate Truck Accident Risk from Telemetric Data using Machine Learning?
New pre-print: File-based localization of numerical perturbations in data analysis pipelines.
New pre-print: Performance benefits of Intel® Optane™ DC persistent memory for the parallel processing of large neuroimaging data.