Pattern clustering

PyPI Status Build Status Documentation Status Readthedocs Status Code Coverage

This tool clusterizes lines of text given a collection of input patterns modeled using regular expressions.

This work has been published to:

[ICPR’2022] A novel pattern-based edit distance for automatic log parsing, Maxime Raynal, Marc-Olivier Buob, Georges Quénot.

Features

  • Forms groups of homogeneous line using a pattern based distance, based on customizable patterns.

  • Configured by default to use common patterns (IP addresses, numeric values, etc.)

License

This project is licensed under the BSD-3-Clause license - see the LICENSE.

More about pattern-clustering

For more information, feel free to visit the wiki:

Acks