-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve performance #28
Conversation
Replace with index_add.
Timing after removing a for loop in the short-range part:
|
Nice! Should we put a profiling example showing some breakdowns? Similar to https://luthaf.fr/rascaline/latest/examples/profiling.html maybe? |
I have the outputs on hand, but they're rather unwieldy, and not particularly informative -- maybe we can try to do something polished once this is done. |
Avoid some big multiplications.
Just for future reference, here are the timings for energy + forces. "Before" is
|
Test pass on |
This is to track progress on removing "obvious" bottlenecks to improve performance.
Timing before starting work (for cutoff=10, 96 atoms ZrO2, script attached -- timings with profiler, so slower than real):
profile.zip
📚 Documentation preview 📚: https://meshlode--28.org.readthedocs.build/en/28/