pathspec
Utility library for gitignore style pattern matching of file paths.
Utility library for gitignore style pattern matching of file paths.
To install this package, run one of the following:
pathspec is a utility library for pattern matching of file paths. So
far this only includes Git's wildmatch pattern matching which itself is
derived from Rsync's wildmatch. Git uses wildmatch for its gitignore_
files.
.. _gitignore: http://git-scm.com/docs/gitignore
Say you have a "Projects" directory and you want to back it up, but only certain files, and ignore others depending on certain conditions::
>>> import pathspec
>>> # The gitignore-style patterns for files to select, but we're including
>>> # instead of ignoring.
>>> spec = """
...
... # This is a comment because the line begins with a hash: "#"
...
... # Include several project directories (and all descendants) relative to
... # the current directory. To reference a directory you must end with a
... # slash: "/"
... /project-a/
... /project-b/
... /project-c/
...
... # Patterns can be negated by prefixing with exclamation mark: "!"
...
... # Ignore temporary files beginning or ending with "~" and ending with
... # ".swp".
... !~*
... !*~
... !*.swp
...
... # These are python projects so ignore compiled python files from
... # testing.
... !*.pyc
...
... # Ignore the build directories but only directly under the project
... # directories.
... !/*/build/
...
... """
We want to use the GitWildMatchPattern class to compile our patterns. The
PathSpec class provides an interface around pattern implementations::
>>> spec = pathspec.PathSpec.from_lines(pathspec.patterns.GitWildMatchPattern, spec.splitlines())
That may be a mouthful but it allows for additional patterns to be implemented
in the future without them having to deal with anything but matching the paths
sent to them. GitWildMatchPattern is the implementation of the actual
pattern which internally gets converted into a regular expression.
PathSpec is a simple wrapper around a list of compiled patterns.
To make things simpler, we can use the registered name for a pattern class
instead of always having to provide a reference to the class itself. The
GitWildMatchPattern class is registered as gitwildmatch::
>>> spec = pathspec.PathSpec.from_lines('gitwildmatch', spec.splitlines())
If we wanted to manually compile the patterns we can just do the following::
>>> patterns = map(pathspec.patterns.GitWildMatchPattern, spec.splitlines())
>>> spec = PathSpec(patterns)
PathSpec.from_lines() is simply a class method which does just that.
If you want to load the patterns from file, you can pass the file instance directly as well::
>>> with open('patterns.list', 'r') as fh:
>>> spec = pathspec.PathSpec.from_lines('gitignore', fh)
You can perform matching on a whole directory tree with::
>>> matches = spec.match_tree('path/to/directory')
Or you can perform matching on a specific set of file paths with::
>>> matches = spec.match_files(file_paths)
Or check to see if an individual file matches::
>>> is_matched = spec.match_file(file_path)
pathspec is licensed under the Mozilla Public License Version 2.0. See
LICENSE or the FAQ_ for more information.
In summary, you may use pathspec with any closed or open source project without affecting the license of the larger work so long as you:
give credit where credit is due,
and release any custom changes made to pathspec.
.. _Mozilla Public License Version 2.0: http://www.mozilla.org/MPL/2.0
.. _LICENSE: LICENSE
.. _FAQ: http://www.mozilla.org/MPL/2.0/FAQ.html
The source code for pathspec is available from the GitHub repo
cpburnz/python-path-specification_.
.. _cpburnz/python-path-specification: https://github.com/cpburnz/python-path-specification
pathspec requires the following packages:
setuptools_pathspec can be installed from source with::
python setup.py install
pathspec is also available for install through PyPI_::
pip install pathspec
.. _setuptools: https://pypi.python.org/pypi/setuptools
.. _PyPI: http://pypi.python.org/pypi/pathspec
Documentation for pathspec is available on Read the Docs_.
.. _Read the Docs: http://python-path-specification.readthedocs.io
pathspec is also available as a Ruby gem_.
.. _Ruby gem: https://github.com/highb/pathspec-ruby
Issue #22_: Handle dangling symlinks... _Issue #22: https://github.com/cpburnz/python-path-specification/issues/22
Issue #21_: Fix collections deprecation warning... _Issue #21: https://github.com/cpburnz/python-path-specification/issues/21
Issue #20_: Support current directory prefix... _Issue #20: https://github.com/cpburnz/python-path-specification/issues/20
Issue #17_: Add link to Ruby implementation of pathspec... _Issue #17: https://github.com/cpburnz/python-path-specification/pull/17
Issue #14_: Fix byte strings for Python 3.Issue #15_: Include "LICENSE" in source package.Issue #16_: Support Python 2.6... _Issue #14: https://github.com/cpburnz/python-path-specification/issues/14
.. _Issue #15: https://github.com/cpburnz/python-path-specification/pull/15
.. _Issue #16: https://github.com/cpburnz/python-path-specification/issues/16
Issue #13_: Add equality methods to PathSpec and RegexPattern... _Issue #13: https://github.com/cpburnz/python-path-specification/pull/13
Issue #12_: Add PathSpec.match_file().gitignore.GitIgnorePattern to patterns.gitwildmatch.GitWildMatchPattern.gitignore.GitIgnorePattern... _Issue #12: https://github.com/cpburnz/python-path-specification/issues/12
Issue #11_: Support converting patterns into regular expressions without compiling them.RegexPattern should implement pattern_to_regex()... _Issue #11: https://github.com/cpburnz/python-path-specification/issues/11
Issue #7_: Fixed non-recursive links.Issue #8_: Fixed edge cases in gitignore patterns.Issue #9_: Fixed minor usage documentation... _Issue #7: https://github.com/cpburnz/python-path-specification/pull/7
.. _Issue #8: https://github.com/cpburnz/python-path-specification/pull/8
.. _Issue #9: https://github.com/cpburnz/python-path-specification/pull/9
Issue #5_: Use tox for testing.Issue #6_: Fixed matching Windows paths.spec.match_tree() and spec.match_files() now return iterators instead of sets... _Issue #5: https://github.com/cpburnz/python-path-specification/pull/5
.. _Issue #6: https://github.com/cpburnz/python-path-specification/issues/6
Issue #3_: Fixed trailing slash in gitignore patterns.Issue #4_: Fixed test for trailing slash in gitignore patterns... _Issue #3: https://github.com/cpburnz/python-path-specification/pull/3
.. _Issue #4: https://github.com/cpburnz/python-path-specification/pull/4
Summary
Utility library for gitignore style pattern matching of file paths.
Last Updated
Oct 1, 2018 at 16:22
License
Mozilla Public License 2.0 (MPL 2.0)
Total Downloads
196
Supported Platforms