Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

High memory useage while running scheduled check for new notifcations #7

Open
AJAYK-01 opened this issue Mar 16, 2021 · 2 comments
Open
Labels
bug Something isn't working help wanted Extra attention is needed

Comments

@AJAYK-01
Copy link
Owner

AJAYK-01 commented Mar 16, 2021

Describe the bug:

When the APScheduler runs the function scheduledjob (line 70, bot.py), the RAM useage spikes up over time, reaching an unreasonable high value (upto 1GB useage in less than a day's use! ), thereby making the bot unhosteable on any of the free plans of common hosting providers.

To Reproduce:

Steps to reproduce the behavior:

  1. Create your own bot token and set up the bot as mentioned in the README instructions.
  2. Run python bot.py
  3. Open up your local memory useage monitor application (task manager) and monitor the memory useage of the bot.
  4. Wait for a few hours or just increase the frequency of the scheduler from 10 mins in line 181, bot.py to see the memory spike sooner.
  5. Try removing a latest notification from your notifications cache (database), to trigger the bot to send a notification to the users, the memory spike is more profound in the case when new notification triggers.

Expected behavior:

The memory useage spikes up with each run of schedule job slowly, and the spike is much more profound when a new notification comes in the KTU website and the bot starts to send users the notification.

Screenshots:

Gradual increase of memory useage:

image


Eventually crossing free tier quota on heroku:

image

Additional context:

Currently the bot is being restarted by a systemd timer every hour in a custom VPS, but this hacky solution can't be used when hosting on hosting providers like heroku.

@AJAYK-01
Copy link
Owner Author

AJAYK-01 commented Mar 23, 2021

Upon further inspection, the memory spike seems to only occur when the scrape() is called in line 22, bot.py

There is no spike when that line is commented out 🤔

@AJAYK-01
Copy link
Owner Author

Then memory spike is less when using html.parser instead of html5lib, but ktu site tends to have broken html tags at times, so it is risky to use html.parser

The issue is caused by the soup object which is not getting destroyed after each scrape.

Tried making soup = None followed by gc.collect() but doesn't seem to have any effect.

@AJAYK-01 AJAYK-01 added bug Something isn't working help wanted Extra attention is needed labels Oct 2, 2021
@AJAYK-01 AJAYK-01 pinned this issue Aug 7, 2022
@AJAYK-01 AJAYK-01 unpinned this issue Oct 30, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

1 participant