# GraphAr PySpark (under development) This directory contains the code and build system for the GraphAr PySpark library. Library is implemented as bindings to GraphAr Scala Spark library and does not contain any real logic. ## Introduction GraphAr PySpark project provides a PySpark API and utilities for working with GraphAr file format from PySpark. The project has the only python dependency -- `pyspark` itself. Currently only `pysaprk~=3.2` is supported, but in the future the scope of supported versions will be extended. ## Installation Currently, the only way to install `graphar_pyspark` is to build it from the source code. The project is made with poetry, so it highly recommended to use this building system. ```shell poetry install ``` It creates a `tar.gz` file in `dist` directory. ## Generating API documentation To generate API documentation, run the following command: ```shell poetry install --with=docs poetry run pdoc -t ./template --output-dir ./docs graphar_pyspark ``` The documentation will be generated in the `docs` directory.