-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
dbt docs browser tab crashes with ~100K source tables #3026
Comments
Hey @panasenco, thanks for raising this. As you can see in dbt-labs/dbt-docs#170 as well, the docs site is due for a refactor, now that there exist projects orders-of-magnitude bigger than it was initially designed to support a few years ago. I think the prescription here may be more severe than the one in that issue: If a How large is your |
Thanks @jtcohen6! The file I'm considering taking a stab at this (currently browsing through the source code). Thinking back to the conversations you had, could you share what refactoring approach you liked the most? What's the most ideal way forward? |
Looking at the source code, the quickest hack would be to:
However, this hack wouldn't be pretty and would probably break countless things. I'll just wait for you guys to implement it properly. In the meantime, I'll write a script to generate dbt doc sites for each of my source databases separately as a workaround. Then people can refer to these individual static sites when discovering data, and use a separate site for actual models. |
For anyone looking for a workaround, I just wrote the first version of my Python script that generates an index of multiple dbt documentation sites: https://github.com/panasenco/dbt-docs-index You can now split your dbt documentation into multiple separate sites and use this script to build an index of them. |
This issue has been marked as Stale because it has been open for 180 days with no activity. If you would like the issue to remain open, please remove the stale label or comment on the issue, or it will be closed in 7 days. |
Describe the bug
dbt docs browser tab crashes when trying to load all ~100K source tables in our data warehouse.
Steps To Reproduce
We have 200 source applications and some applications have thousands of tables. See model yml file inside big.zip, which approximately replicates the size of our data warehouse.
After adding the above yml file to models folder, run
dbt docs generate
and thendbt docs serve
.Expected behavior
Expected to be able to navigate the docs site.
Screenshots and log output
Firefox:
Chrome:
System information
Which database are you using dbt with?
The output of
dbt --version
:The operating system you're using:
Windows 10
The output of
python --version
:Python 3.8.3
Additional context
40,000 tables eventually loads...
How difficult would it be to add a docs generation mode where each table gets its own html page and we avoid loading the entire manifest.json into browser memory?
The text was updated successfully, but these errors were encountered: