r-contrast
|
public |
One degree of freedom contrasts for 'lm', 'glm', 'gls', and 'geese' objects.
|
2025-09-22 |
r-gpca
|
public |
This package implements guided principal components analysis for the detection of batch effects in high-throughput data.
|
2025-09-22 |
stir
|
public |
Software for Tomographic Image Reconstruction
|
2025-09-22 |
r-baguette
|
public |
Tree- and rule-based models can be bagged using this package and their predictions equations are stored in an efficient format to reduce the model objects size and speed.
|
2025-09-22 |
r-fdboost
|
public |
Regression models for functional data, i.e., scalar-on-function, function-on-scalar and function-on-function regression models, are fitted by a component-wise gradient boosting algorithm.
|
2025-09-22 |
r-ssc
|
public |
Provides a collection of self-labeled techniques for semi-supervised classification. In semi-supervised classification, both labeled and unlabeled data are used to train a classifier. This learning paradigm has obtained promising results, specifically in the presence of a reduced set of labeled examples. This package implements a collection of self-labeled techniques to construct a classification model. This family of techniques enlarges the original labeled set using the most confident predictions to classify unlabeled data. The techniques implemented can be applied to classification problems in several domains by the specification of a supervised base classifier. At low ratios of labeled data, it can be shown to perform better than classical supervised classifiers.
|
2025-09-22 |
flake8-black
|
public |
Extension for flake8 for validating Python code style using black
|
2025-09-22 |
r-rslurm
|
public |
Functions that simplify submitting R scripts to a 'Slurm' workload manager, in part by automating the division of embarrassingly parallel calculations across cluster nodes.
|
2025-09-22 |
r-snpls
|
public |
Tools for performing variable selection in three-way data using N-PLS in combination with L1 penalization, Selectivity Ratio and VIP scores. The N-PLS model (Rasmus Bro, 1996 <DOI:10.1002/(SICI)1099-128X(199601)10:1%3C47::AID-CEM400%3E3.0.CO;2-C>) is the natural extension of PLS (Partial Least Squares) to N-way structures, and tries to maximize the covariance between X and Y data arrays. The package also adds variable selection through L1 penalization, Selectivity Ratio and VIP scores.
|
2025-09-22 |
r-coursekata
|
public |
Easily install and load all packages and functions used in 'CourseKata' courses. Aid teaching with helper functions and augment generic functions to provide cohesion between the network of packages. Learn more about 'CourseKata' at <https://coursekata.org>.
|
2025-09-22 |
uproot
|
public |
ROOT I/O in pure Python and NumPy.
|
2025-09-21 |
r-openair
|
public |
Tools to analyse, interpret and understand air pollution data. Data are typically hourly time series and both monitoring data and dispersion model output can be analysed. Many functions can also be applied to other data, including meteorological and traffic data.
|
2025-09-21 |
r-polca
|
public |
Latent class analysis and latent class regression models for polytomous outcome variables. Also known as latent structure analysis.
|
2025-09-21 |
py-multibase
|
public |
Multibase implementation for Python
|
2025-09-21 |
py-multicodec
|
public |
Multicodec implementation in Python
|
2025-09-21 |
r-mbsp
|
public |
Gibbs sampler for fitting multivariate Bayesian linear regression with shrinkage priors (MBSP), using the three parameter beta normal family. The method is described in Bai and Ghosh (2018) <doi:10.1016/j.jmva.2018.04.010>.
|
2025-09-21 |
pymultihash
|
public |
Python implementation of the multihash specification
|
2025-09-21 |
r-clarify
|
public |
Performs simulation-based inference as an alternative to the delta method for obtaining valid confidence intervals and p-values for regression post-estimation quantities, such as average marginal effects and predictions at representative values. This framework for simulation-based inference is especially useful when the resulting quantity is not normally distributed and the delta method approximation fails. The methodology is described in King, Tomz, and Wittenberg (2000) <doi:10.2307/2669316>. 'clarify' is meant to replace some of the functionality of the archived package 'Zelig'; see the vignette "Translating Zelig to clarify" for replicating this functionality.
|
2025-09-21 |
restic
|
public |
Restic is a fast and secure backup program.
|
2025-09-21 |
r-lime
|
public |
When building complex models, it is often difficult to explain why the model should be trusted. While global measures such as accuracy are useful, they cannot be used for explaining why a model made a specific prediction. 'lime' (a port of the 'lime' 'Python' package) is a method for explaining the outcome of black box models by fitting a local model around the point in question an perturbations of this point. The approach is described in more detail in the article by Ribeiro et al. (2016) <arXiv:1602.04938>.
|
2025-09-21 |
r-dixontest
|
public |
For outlier detection in small and normally distributed samples the ratio test of Dixon (Q-test) can be used. Density, distribution function, quantile function and random generation for Dixon's ratio statistics are provided as wrapper functions. The core applies McBane's Fortran functions <doi:10.18637/jss.v016.i03> that use Gaussian quadrature for a numerical solution.
|
2025-09-21 |
r-gsalib
|
public |
This package contains utility functions used by the Genome Analysis Toolkit (GATK) to load tables and plot data. The GATK is a toolkit for variant discovery in high-throughput sequencing data.
|
2025-09-21 |
poetry-core
|
public |
Poetry PEP 517 Build Backend
|
2025-09-21 |
r-htree
|
public |
Historical regression trees are an extension of standard trees, producing a non-parametric estimate of how the response depends on all of its prior realizations as well as that of any time-varying predictor variables. The method applies equally to regularly as well as irregularly sampled data. The package implements random forest and boosting ensembles based on historical regression trees, suitable for longitudinal data. Standard error estimation and Z-score variable importance is also implemented.
|
2025-09-21 |
r-ocedata
|
public |
Several Oceanographic data sets are provided for use by the 'oce' package and for other purposes.
|
2025-09-21 |