-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
new OBO loader wont accept KEGG OBO. #18
Comments
simply adding
oh dear.
We defined terms that we didnt want to "take seriously" because they dont have kegg accessions like the above one. But now we're going to fail the EBI OLS lookup for each. @spficklin this is actually a big problem. We need an "override" option for the importer that wont look up terms and will just blindly insert. I dontk now that we have an alternative. There are literally no KEGG terms in EBI to look up, and because some terms use redundant names/accessions, we HAVE to define new DBs for them or it wont load properly. |
additional idea: |
background info to help:
and see also : #7 |
@spficklin has brought up using local terms instead. This seems like a good, direct solution.
Local DB, but KEGG CV i guess? I'd have to think about that... |
The previous version of the KEGG module created three different databases: As a second thought.... each of the leaf terms seem to have a properly formed ID already created by KEGG. For example: For example, the OBO right now converts this term: KO:K03022 to KEGG:K03022 I would suggest it just be left as KO:K03022 in the OBO file. Oh, and I think the OBO may have other issues too. For example. This term:
This is for an ortholog (gene). and the relationship with pathways is I think the term stanza should look like this instead:
|
So after talking with @bradfordcondon a bit on Slack it seems the desire is to have all of the KEGG vocabularies appear in a single tree like the Gene Ontology vocabularies do. This is why However, The problem here for putting them all in the same "tree" for viewing is that KEGG uses a different prefix ID for each vocabulary similar to the EDAM vocabulary. Chado doesn't support this and for EDAM we've had to reverse the CV and DB records to make this work and we've always programmatically loaded EDAM terms. So, while I still think we should setup the OBO to use the correct terms (created by KEGG) we would need to adjust the OBO loader to somehow "reverse" the CV and DBs when storing in Chado. This wouldn't be unprecedented because we do it for EDAM. |
Also, for backwards compatibility we should do something with the old Tv2 vocabularies created by this module as they are in use on some sites I think this module, in the install, once we settle on how the actual CV/DB records will be should rename those old CV/DB records to use the same names. Then, the OBO loader should update old terms when it's loaded. |
Perhaps working out the solution to this issue for Chado will resolve the problem for us: GMOD/Chado#58 |
to add to this, the new KEGG annotations being returned have new terms that weren't in the previous OBO, forcing us to ignore them or fix this problem.... |
Hi, I have tried to upload kegg obo using tripal v3.2, and the same error occured as @bradfordcondon described . Is this problem solved so far? Thank you |
I’m sorry to report it is not.
@spficklin has your thinking changed on this? (i know you are out of the office right now ill remind later) I think its realistic for us to fix the OBO builder to create the required fields at the top of the file. Overhauling the OBO to properly use the correct CVs/DBs is harder and I dont know that it will ever move into our crosshairs... I think its not as simple as just swapping teh CVS/DBs because of redundant terms...
As of right now this module is more or less nonfunctional without a fix... perhaps we should indicate that in the docs/on the README here if we cant address soon.
|
We'll be taking a look at this again. From what I can tell from all the discussion here and in other related issues, there are three main possible routes:
There was also some movement on modifying Chado to better accept KEGG (as well as EDAM) I'm looking to revive the discussion here. Is there a preferred direction? |
error:
Probably some OBO header info that isnt printed out
The text was updated successfully, but these errors were encountered: