r-xslt
|
public |
An extension for the 'xml2' package to transform XML documents by applying an 'xslt' style-sheet.
|
2025-04-22 |
r-xptr
|
public |
There is limited native support for external pointers in the R interface. This package provides some basic tools to verify, create and modify 'externalptr' objects.
|
2025-04-22 |
r-xnomial
|
public |
Tests whether a set of counts fit a given expected ratio. For example, a genetic cross might be expected to produce four types in the relative frequencies of 9:3:3:1. To see whether a set of observed counts fits this expectation, one can examine all possible outcomes with xmulti() or a random sample of them with xmonte() and find the probability of an observation deviating from the expectation by at least as much as the observed. As a measure of deviation from the expected, one can use the log-likelihood ratio, the multinomial probability, or the classic chi-square statistic. A histogram of the test statistic can also be plotted and compared with the asymptotic curve.
|
2025-04-22 |
r-xgboost
|
public |
Extreme Gradient Boosting, which is an efficient implementation of the gradient boosting framework from Chen & Guestrin (2016) <doi:10.1145/2939672.2939785>. This package is its R interface. The package includes efficient linear model solver and tree learning algorithms. The package can automatically do parallel computation on a single machine which could be more than 10 times faster than existing gradient boosting packages. It supports various objective functions, including regression, classification and ranking. The package is made to be extensible, so that users are also allowed to define their own objectives easily.
|
2025-04-22 |
r-xbrl
|
public |
Functions to extract business financial information from an Extensible Business Reporting Language ('XBRL') instance file and the associated collection of files that defines its 'Discoverable' Taxonomy Set ('DTS').
|
2025-04-22 |
r-wwr
|
public |
Calculate the (weighted) win loss statistics including the win ratio, win difference and win product and their variances, with which the p-values are also calculated. The variance estimation is based on Luo et al. (2015) <doi:10.1111/biom.12225> and Luo et al. (2017) <doi:10.1002/sim.7284>. This package also calculates general win loss statistics with user-specified win loss function with variance estimation based on Bebu and Lachin (2016) <doi:10.1093/biostatistics/kxv032>. This version corrected an error when outputting confidence interval for win difference.
|
2025-04-22 |
r-wtest
|
public |
Perform the calculation of W-test, diagnostic checking, calculate minor allele frequency (MAF) and odds ratio.
|
2025-04-22 |
r-wsrf
|
public |
A parallel implementation of Weighted Subspace Random Forest. The Weighted Subspace Random Forest algorithm was proposed in the International Journal of Data Warehousing and Mining by Baoxun Xu, Joshua Zhexue Huang, Graham Williams, Qiang Wang, and Yunming Ye (2012) <DOI:10.4018/jdwm.2012040103>. The algorithm can classify very high-dimensional data with random forests built using small subspaces. A novel variable weighting method is used for variable subspace selection in place of the traditional random variable sampling.This new approach is particularly useful in building models from high-dimensional data.
|
2025-04-22 |
r-wskm
|
public |
Entropy weighted k-means (ewkm) is a weighted subspace clustering algorithm that is well suited to very high dimensional data. Weights are calculated as the importance of a variable with regard to cluster membership. The two-level variable weighting clustering algorithm tw-k-means (twkm) introduces two types of weights, the weights on individual variables and the weights on variable groups, and they are calculated during the clustering process. The feature group weighted k-means (fgkm) extends this concept by grouping features and weighting the group in addition to weighting individual features.
|
2025-04-22 |
r-wrswor
|
public |
A collection of implementations of classical and novel algorithms for weighted sampling without replacement.
|
2025-04-22 |
r-writexl
|
public |
Zero-dependency data frame to xlsx exporter based on 'libxlsxwriter'. Fast and no Java or Excel required.
|
2025-04-22 |
r-wrassp
|
public |
A wrapper around Michel Scheffers's 'libassp' (<http://libassp.sourceforge.net/>). The 'libassp' (Advanced Speech Signal Processor) library aims at providing functionality for handling speech signal files in most common audio formats and for performing analyses common in phonetic science/speech science. This includes the calculation of formants, fundamental frequency, root mean square, auto correlation, a variety of spectral analyses, zero crossing rate, filtering etc. This wrapper provides R with a large subset of 'libassp's signal processing functions and provides them to the user in a (hopefully) user-friendly manner.
|
2025-04-22 |
r-wpkde
|
public |
Weighted Piecewise Kernel Density Estimation for large data.
|
2025-04-22 |
r-wordcloud
|
public |
Functionality to create pretty word clouds, visualize differences and similarity between documents, and avoid over-plotting in scatter plots with text.
|
2025-04-22 |
r-wlreg
|
public |
Use various regression models for the analysis of win loss endpoints adjusting for non-binary and multivariate covariates.
|
2025-04-22 |
r-wingui
|
public |
Helps for interfacing with the operating system particularly for Windows.
|
2025-04-22 |
r-wicket
|
public |
Utilities to generate bounding boxes from 'WKT' (Well-Known Text) objects and R data types, validate 'WKT' objects and convert object types from the 'sp' package into 'WKT' representations.
|
2025-04-22 |
r-whopgenome
|
public |
Provides very fast access to whole genome, population scale variation data from VCF files and sequence data from FASTA-formatted files. It also reads in alignments from FASTA, Phylip, MAF and other file formats. Provides easy-to-use interfaces to genome annotation from UCSC and Bioconductor and gene ontology data from AmiGO and is capable to read, modify and write PLINK .PED-format pedigree files.
|
2025-04-22 |
r-wfe
|
public |
Provides a computationally efficient way of fitting weighted linear fixed effects estimators for causal inference with various weighting schemes. Weighted linear fixed effects estimators can be used to estimate the average treatment effects under different identification strategies. This includes stratified randomized experiments, matching and stratification for observational studies, first differencing, and difference-in-differences. The package implements methods described in Imai and Kim (2017) "When should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?", available at <https://imai.fas.harvard.edu/research/FEmatch.html>.
|
2025-04-22 |
r-weibullr
|
public |
Life data analysis in the graphical tradition of Waloddi Weibull. Methods derived from Robert B. Abernethy (2008, ISBN 0-965306-3-2), Wayne Nelson (1982, ISBN: 9780471094586) <DOI:10.1002/0471725234>, William Q. Meeker and Lois A. Escobar (1998, ISBN: 1-471-14328-6), John I. McCool, (2012, ISBN: 9781118217986) <DOI:10.1002/9781118351994>.
|
2025-04-22 |
r-pullword
|
public |
R Interface to Pullword Service for natural language processing in Chinese. It enables users to extract valuable words from text by deep learning models. For more details please visit the official site (in Chinese) http://pullword.com/.
|
2025-04-22 |
r-ptw
|
public |
Parametric Time Warping aligns patterns, i.e. it aims to put corresponding features at the same locations. The algorithm searches for an optimal polynomial describing the warping. It is possible to align one sample to a reference, several samples to the same reference, or several samples to several references. One can choose between calculating individual warpings, or one global warping for a set of samples and one reference. Two optimization criteria are implemented: RMS (Root Mean Square error) and WCC (Weighted Cross Correlation). Both warping of peak profiles and of peak lists are supported.
|
2025-04-22 |
r-ptsuite
|
public |
Various estimation methods for the shape parameter of Pareto distributed data. This package contains functions for various estimation methods such as maximum likelihood (Newman, 2005)<doi:10.1016/j.cities.2012.03.001>, Hill's estimator (Hill, 1975)<doi:10.1214/aos/1176343247>, least squares (Zaher et al., 2014)<doi:10.9734/BJMCS/2014/10890>, method of moments (Rytgaard, 1990)<doi:10.2143/AST.20.2.2005443>, percentiles (Bhatti et al., 2018)<doi:10.1371/journal.pone.0196456>, and weighted least squares (Nair et al., 2019) to estimate the shape parameter of Pareto distributed data. It also provides both a heuristic method (Hubert et al., 2013)<doi:10.1016/j.csda.2012.07.011> and a goodness of fit test (Gulati and Shapiro, 2008)<doi:10.1007/978-0-8176-4619-6> for testing for Pareto data as well as a method for generating Pareto distributed data.
|
2025-04-22 |
r-pts2polys
|
public |
Various applications in invasive species biology, conservation biology, epidemiology and elsewhere involve sampling of sets of 2D points from a posterior distribution. The number of such point sets may be large, say 1000 or 10000. This package facilitates visualisation of such output by constructing seven nested polygons representing the location and variability of the point sets. This can be used, for example, to visualise the range boundary of a species, and uncertainty in the location of that boundary.
|
2025-04-22 |
r-ptest
|
public |
Implements p-value computations using an approximation to the cumulative distribution function for a variety of tests for periodicity. These tests include harmonic regression tests with normal and double exponential errors as well as modifications of Fisher's g test. An accompanying vignette illustrates the application of these tests.
|
2025-04-22 |