r-modes
|
public |
Designed with a dual purpose of accurately estimating the mode (or modes) as well as characterizing the modality of data. The specific application area includes complex or mixture distributions particularly in a big data environment. The heterogeneous nature of (big) data may require deep introspective statistical and machine learning techniques, but these statistical tools often fail when applied without first understanding the data. In small datasets, this often isn't a big issue, but when dealing with large scale data analysis or big data thoroughly inspecting each dimension typically yields an O(n^n-1) problem. As such, dealing with big data require an alternative toolkit. This package not only identifies the mode or modes for various data types, it also provides a programmatic way of understanding the modality (i.e. unimodal, bimodal, etc.) of a dataset (whether it's big data or not). See <http://www.sdeevi.com/modes_package> for examples and discussion.
|
2025-04-22 |
xtensor-r
|
public |
R bindings for xtensor, the C++ tensor algebra library
|
2025-04-22 |
r-skynet
|
public |
A flexible tool that allows generating bespoke air transport statistics for urban studies based on publicly available data from the Bureau of Transport Statistics (BTS) in the United States <https://www.transtats.bts.gov/databases.asp?Mode_ID=1&Mode_Desc=Aviation&Subject_ID2=0>.
|
2025-04-22 |
r-tree
|
public |
Classification and regression trees.
|
2025-04-22 |
tsl_robin_map
|
public |
C++ fast hash map and hash set using robin hood hashing
|
2025-04-22 |
r-refgenome
|
public |
Contains functionalities for importing and managing downloaded genome annotation data from 'Ensembl' genome browser (European Bioinformatics Institute, <https://www.ensembl.org>) and from 'UCSC' genome browser (University of California, Santa Cruz, <https://genome.ucsc.edu/>) and annotation routines for genomic positions and splice site positions.
|
2025-04-22 |
pyctest
|
public |
Python bindings to select portions of CMake/CTest package -- enabling generation of CTests for CDash submission without a CMake build system
|
2025-04-22 |
tsl_hopscotch_map
|
public |
C++ hash map and hash set using hopscotch hashing
|
2025-04-22 |
r-objectsignals
|
public |
A mutable Signal object can report changes to its state, clients could register functions so that they are called whenever the signal is emited. The signal could be emited, disconnected, blocked, unblocked, and buffered.
|
2025-04-22 |
r-wdi
|
public |
Search, extract and format data from the World Bank's World Development Indicators.
|
2025-04-22 |
datefudge
|
public |
Fake the system date
|
2025-04-22 |
r-expsmooth
|
public |
Data sets from the book "Forecasting with exponential smoothing: the state space approach" by Hyndman, Koehler, Ord and Snyder (Springer, 2008).
|
2025-04-22 |
r-fma
|
public |
All data sets from "Forecasting: methods and applications" by Makridakis, Wheelwright & Hyndman (Wiley, 3rd ed., 1998) <https://robjhyndman.com/forecasting/>.
|
2025-04-22 |
r-kableextra
|
public |
Build complex HTML or 'LaTeX' tables using 'kable()' from 'knitr' and the piping syntax from 'magrittr'. Function 'kable()' is a light weight table generator coming from 'knitr'. This package simplifies the way to manipulate the HTML or 'LaTeX' codes generated by 'kable()' and allows users to construct complex tables and customize styles using a readable syntax.
|
2025-04-22 |
r-multiway
|
public |
Fits multi-way component models via alternating least squares algorithms with optional constraints. Fit models include N-way Canonical Polyadic Decomposition, Individual Differences Scaling, Multiway Covariates Regression, Parallel Factor Analysis (1 and 2), Simultaneous Component Analysis, and Tucker Factor Analysis.
|
2025-04-22 |
r-cmls
|
public |
Solves multivariate least squares (MLS) problems subject to constraints on the coefficients, e.g., non-negativity, orthogonality, equality, inequality, monotonicity, unimodality, smoothness, etc. Includes flexible functions for solving MLS problems subject to user-specified equality and/or inequality constraints, as well as a wrapper function that implements 24 common constraint options. Also does k-fold or generalized cross-validation to tune constraint options for MLS problems. See ten Berge (1993, ISBN:9789066950832) for an overview of MLS problems, and see Goldfarb and Idnani (1983) <doi:10.1007/BF02591962> for a discussion of the underlying quadratic programming algorithm.
|
2025-04-22 |
r-rtensor
|
public |
A set of tools for creation, manipulation, and modeling of tensors with arbitrary number of modes. A tensor in the context of data analysis is a multidimensional array. rTensor does this by providing a S4 class 'Tensor' that wraps around the base 'array' class. rTensor provides common tensor operations as methods, including matrix unfolding, summing/averaging across modes, calculating the Frobenius norm, and taking the inner product between two tensors. Familiar array operations are overloaded, such as index subsetting via '[' and element-wise operations. rTensor also implements various tensor decomposition, including CP, GLRAM, MPCA, PVD, and Tucker. For tensors with 3 modes, rTensor also implements transpose, t-product, and t-SVD, as defined in Kilmer et al. (2013). Some auxiliary functions include the Khatri-Rao product, Kronecker product, and the Hadamard product for a list of matrices.
|
2025-04-22 |
r-threeway
|
public |
Component analysis for three-way data arrays by means of Candecomp/Parafac, Tucker3, Tucker2 and Tucker1 models.
|
2025-04-22 |
r-rgtk2
|
public |
Facilities in the R language for programming graphical interfaces using Gtk, the Gimp Tool Kit.
|
2025-04-22 |
r-smoothwin
|
public |
The main function in the package utilizes a windowing function in the form of an exponential weighting function to linear models. The bandwidth and sharpness of the window are controlled by two parameters. Then, a series of tests are used to identify the right parameters of the window (see Hamed Haselimashhadi et al (2019) <https://www.biorxiv.org/content/10.1101/656678v1>).
|
2025-04-22 |
r-emdist
|
public |
Package providing calculation of Earth Mover's Distance (EMD).
|
2025-04-22 |
r-rpmm
|
public |
Recursively Partitioned Mixture Model for Beta and Gaussian Mixtures. This is a model-based clustering algorithm that returns a hierarchy of classes, similar to hierarchical clustering, but also similar to finite mixture models.
|
2025-04-22 |
r-maptpx
|
public |
Posterior maximization for topic models (LDA) in text analysis, as described in Taddy (2012) `on estimation and selection for topic models'. Previous versions of this code were included as part of the textir package. If you want to take advantage of openmp parallelization, uncomment the relevant flags in src/MAKEVARS before compiling.
|
2025-04-22 |
r-catnet
|
public |
Structure learning and parameter estimation of discrete Bayesian networks using likelihood-based criteria. Exhaustive search for fixed node orders and stochastic search of optimal orders via simulated annealing algorithm are implemented.
|
2025-04-22 |
r-cdft
|
public |
This package proposes a statistical downscaling method for cumulative distribution functions (CDF), as well as the computation of the Cram\`er-von Mises statistics U, and the Kolmogorov-Smirnov statistics KS.
|
2025-04-22 |