Skip to content

Releases: BaseModelAI/cleora

v2.0.0

24 Nov 21:52
3cc300d
Compare
Choose a tag to compare

Cleora is now available as a Python package pycleora. Key improvements compared to the previous version:

  • performance optimizations: ~10x faster embedding times
  • performance optimizations: significantly reduced memory usage
  • latest research: improved embedding quality
  • new feature: can create graphs from a Python iterator in addition to tsv files
  • new feature: seamless integration with NumPy
  • new feature: item attributes support via custom embeddings initialization
  • new feature: adjustable vector projection / normalization after each propagation step

Breaking changes:

  • transient modifier not supported any more - creating complex::reflexive columns for hypergraph embeddings, grouped by the transient entity gives better results.

v1.2.3

29 Jun 14:56
4810af4
Compare
Choose a tag to compare

Changed

  • Bump libs (#60).

Fixed

  • Check for malformed lines in input (#59).

v1.2.2

24 Jun 13:20
56269cd
Compare
Choose a tag to compare

Changed

  • Allow cleora to accept multiple input files as positional args. Named argument 'input' is getting deprecated (#55).

v1.2.1

13 Apr 10:04
4d85152
Compare
Choose a tag to compare

Changed

  • Optimize "--output-format numpy" mode, so it doesn't require additional memory when writing output file (#50).
  • Bump libs (#52).

v1.2.0

17 Mar 16:52
Compare
Choose a tag to compare

Added

  • Use default hasher for vector init. (#47).

v1.1.1

14 May 12:58
d78a53b
Compare
Choose a tag to compare

Added

  • Init embedding with seed during training (#27).

Cleora v1.1.0

23 Dec 18:07
ded180a
Compare
Choose a tag to compare

Changed

  • Bumped env_logger to 0.8.2, smallvec to 1.5.1, removed fnv hasher (#11).

Added

  • Tests (snapshots) for in-memory and memory-mapped files calculations of embeddings (#12).
  • Support for NumPy output format (available via --output-format program argument) (#15).
  • Jupyter notebooks with experiments (#16).

Improved

  • Used vector for hash_to_id mappings, non-allocating cartesian product, ryu crate for faster write (#13).
  • Sparse Matrix refactor (cleanup, simplification, using iter, speedup). Use Cargo.toml data for clap crate (#17).
  • Unify and simplify embeddings calculation for in-memory and mmap matrices (#18).

Cleora v1.0.1

23 Nov 16:37
Compare
Choose a tag to compare

Fixed

  • Skip reading invalid UTF-8 line (#8).
  • Fix clippy warnings (#7).

Added

  • JSON support (#3).
  • Snapshot testing (#5).

Cleora v1.0.0

23 Nov 08:55
7692d67
Compare
Choose a tag to compare

Initial release.