r-baseset
|
public |
Implements a class and methods to work with sets, doing intersection, union, complementary sets, power sets, cartesian product and other set operations in a "tidy" way. These set operations are available for both classical sets and fuzzy sets. Import sets from several formats or from other several data structures.
|
2025-09-22 |
r-plotluck
|
public |
Examines the characteristics of a data frame and a formula to automatically choose the most suitable type of plot out of the following supported options: scatter, violin, box, bar, density, hexagon bin, spine plot, and heat map. The aim of the package is to let the user focus on what to plot, rather than on the "how" during exploratory data analysis. It also automates handling of observation weights, logarithmic axis scaling, reordering of factor levels, and overlaying smoothing curves and median lines. Plots are drawn using 'ggplot2'.
|
2025-09-22 |
r-mlgt
|
public |
Processing and analysis of high throughput (Roche 454) sequences generated from multiple loci and multiple biological samples. Sequences are assigned to their locus and sample of origin, aligned and trimmed. Where possible, genotypes are called and variants mapped to known alleles.
|
2025-09-22 |
r-oriclust
|
public |
ORIClust is a user-friendly R-based software package for gene clustering. Clusters are given by genes matched to prespecified profiles across various ordered treatment groups. It is particularly useful for analyzing data obtained from short time-course or dose-response microarray experiments.
|
2025-09-22 |
terraform-provider-aws
|
public |
The Terraform AWS provider
|
2025-09-22 |
numcosmo
|
public |
NumCosmo is a free software C library whose main purposes are to test cosmological models using observational data and to provide a set of tools to perform cosmological calculations.
|
2025-09-22 |
posthog
|
public |
Integrate PostHog into any python application
|
2025-09-22 |
r-kriging
|
public |
Simple and highly optimized ordinary kriging algorithm to plot geographical data
|
2025-09-22 |
llama-cloud-services
|
public |
Tailored SDK clients for LlamaCloud services.
|
2025-09-22 |
mne-lsl
|
public |
Real-time framework integrated with MNE-Python for online neuroscience research through LSL-compatible devices.
|
2025-09-22 |
r-densityclust
|
public |
An improved implementation (based on k-nearest neighbors) of the density peak clustering algorithm, originally described by Alex Rodriguez and Alessandro Laio (Science, 2014 vol. 344). It can handle large datasets (> 100, 000 samples) very efficiently. It was initially implemented by Thomas Lin Pedersen, with inputs from Sean Hughes and later improved by Xiaojie Qiu to handle large datasets with kNNs.
|
2025-09-22 |
r-udpipe
|
public |
This natural language processing toolkit provides language-agnostic 'tokenization', 'parts of speech tagging', 'lemmatization' and 'dependency parsing' of raw text. Next to text parsing, the package also allows you to train annotation models based on data of 'treebanks' in 'CoNLL-U' format as provided at <https://universaldependencies.org/format.html>. The techniques are explained in detail in the paper: 'Tokenizing, POS Tagging, Lemmatizing and Parsing UD 2.0 with UDPipe', available at <doi:10.18653/v1/K17-3009>. The toolkit also contains functionalities for commonly used data manipulations on texts which are enriched with the output of the parser. Namely functionalities and algorithms for collocations, token co-occurrence, document term matrix handling, term frequency inverse document frequency calculations, information retrieval metrics (Okapi BM25), handling of multi-word expressions, keyword detection (Rapid Automatic Keyword Extraction, noun phrase extraction, syntactical patterns) sentiment scoring and semantic similarity analysis.
|
2025-09-22 |
r-lokern
|
public |
Kernel regression smoothing with adaptive local or global plug-in bandwidth selection.
|
2025-09-22 |
r-anomalydetection
|
public |
Implements procedures developed by Gutierrez et al. (2017, <https://journal.r-project.org/archive/2017/RJ-2017-039/index.html>) to aid in detecting network log anomalies. By combining various multivariate analytic approaches relevant to network anomaly detection, it provides cyber analysts efficient means to detect suspected anomalies requiring further evaluation.
|
2025-09-22 |
ome-zarr-models
|
public |
A minimal Python package for reading OME-Zarr (meta)data
|
2025-09-22 |
curated-tokenizers
|
public |
Lightweight piece tokenization library
|
2025-09-22 |
hyp3_sdk
|
public |
A python wrapper around the HyP3 API
|
2025-09-22 |
r-topicmodels
|
public |
Provides an interface to the C code for Latent Dirichlet Allocation (LDA) models and Correlated Topics Models (CTM) by David M. Blei and co-authors and the C++ code for fitting LDA models using Gibbs sampling by Xuan-Hieu Phan and co-authors.
|
2025-09-22 |
r-turfr
|
public |
Package for analyzing TURF (Total Unduplicated Reach and Frequency) data in R. No looping in TURF algorithm results in fast processing times. Allows for individual-level weights, depth specification, and user-truncated combination set(s). Allows user to substitute Monte Carlo simulated combination set(s) after set(s) exceed a user-specified limit.
|
2025-09-22 |
r-treeheatr
|
public |
Creates interpretable decision tree visualizations with the data represented as a heatmap at the tree's leaf nodes. 'treeheatr' utilizes the customizable 'ggparty' package for drawing decision trees.
|
2025-09-22 |
r-coranking
|
public |
Calculates the co-ranking matrix to assess the quality of a dimensionality reduction.
|
2025-09-22 |
r-optm
|
public |
The popular population genetic software 'Treemix' by 'Pickrell and Pritchard' (2012) <DOI:10.1371/journal.pgen.1002967> estimates the number of migration edges on a population tree. However, it can be difficult to determine the number of migration edges to include. Previously, it was customary to stop adding migration edges when 99.8% of variation in the data was explained, but 'OptM' automates this process using an ad hoc statistic based on the second-order rate of change in the log likelihood. 'OptM' also has added functionality for various threshold modeling to compare with the ad hoc statistic.
|
2025-09-22 |
r-mvn
|
public |
Performs multivariate normality tests and graphical approaches and implements multivariate outlier detection and univariate normality of marginal distributions through plots and tests, and performs multivariate Box-Cox transformation (Korkmaz et al, (2014), <https://journal.r-project.org/archive/2014-2/korkmaz-goksuluk-zararsiz.pdf>).
|
2025-09-22 |
cargo-c
|
public |
build and install C-compatible libraries
|
2025-09-22 |
r-msos
|
public |
Multivariate Analysis methods and data sets used in John Marden's book Multivariate Statistics: Old School (2015) <ISBN:978-1456538835>. This also serves as a companion package for the STAT 571: Multivariate Analysis course offered by the Department of Statistics at the University of Illinois at Urbana-Champaign ('UIUC').
|
2025-09-22 |