Skip to content

Commit

Permalink
add limitations to docs as per #158
Browse files Browse the repository at this point in the history
  • Loading branch information
brendan committed Jun 6, 2023
1 parent c268f94 commit 786a604
Show file tree
Hide file tree
Showing 2 changed files with 8 additions and 0 deletions.
1 change: 1 addition & 0 deletions docsrc/index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -15,6 +15,7 @@ aims to serve as a general purpose python library for importing, analysing, mani
/examples
/phase_space_format
/supported_particles
/limitations
/code_docs

Indices and tables
Expand Down
7 changes: 7 additions & 0 deletions docsrc/limitations.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
# Limitations

The major limitation of this code at the time of writing is that it can only easily handle data that fits inside memory. This is partially a result of the choice of pandas as the backend - [this page](https://pandas.pydata.org/docs/user_guide/scale.html) describes some difficulties and solutions of handling large data with pandas.

As is discussed in the above link: if your data is too big to fit inside RAM, it should be possible to read and process your data in 'chunks' where each chunk can fit inside memory. This is not supported in most data loaders, but should be possible with minimal extensions - open an issue and we can talk about it!

Beyond this, libraries such as [DASK](https://www.dask.org/) may enable using this library of distributed resources. This is discussed a little bit in [this issue](https://github.com/bwheelz36/ParticlePhaseSpace/issues/158), with an example of utilising DASK on an OpenPMD dataset.

0 comments on commit 786a604

Please sign in to comment.