×

A set of scikit-learn-style transformers for encoding categorical variables into numeric with different techniques. While ordinal, one-hot, and hashing encoders have similar equivalents in the existing scikit-learn version, the transformers in this library all share a few useful properties: - First-class support for pandas dataframes as an input (and optionally as output) - Can explicitly configure which columns in the data are encoded by name or index, or infer non-numeric columns regardless of input type - Can drop any columns with very low variance based on training set optionally - Portability: train a transformer on data, pickle it, reuse it later and get the same thing out. - Full compatibility with sklearn pipelines, input an array-like dataset like any other transformer

Uploaded Mon Mar 31 20:58:55 2025
md5 checksum 75beb30b48d4d343086fa588e36c666a
arch x86_64
build py312h06a4308_0
depends numpy >=1.14.0,<2.0a0, pandas >=1.0.5, patsy >=0.5.1, python >=3.12,<3.13.0a0, scikit-learn >=0.20.0, scipy >=1.0.0, statsmodels >=0.9.0
license BSD-3-Clause
license_family BSD
md5 75beb30b48d4d343086fa588e36c666a
name category_encoders
platform linux
sha1 be9d079e3224bdd3e12e6c4c552046d3ef5813aa
sha256 a7f5b290dd416041c91e3123eff790604331bdba49deaf1335344e1fcfdb47c6
size 129300
subdir linux-64
timestamp 1707357794859
version 2.6.3