Replies: 1 comment 2 replies
-
We're always happy to answer these questions.
Docket entries could be anything, yes: Filings, opinions, motions, exhibits, whatever. Items in the Opinions table are only...opinions! And yes, it's true that you can find opinions in the docket entries table, but they're different because they don't aim to have the same level of quality. They lack HTML, they don't have granular metadata, etc. My advice, if you're working with opinions, is steer clear of the docket-entries tables, if you can.
If you find something in the docket-entries table, I'd ignore it, if you can, and look for a copy of it in the opinions tables.
We get the content from a variety of sources and when we do we save the various representations of the text that we get into separate fields. In general, the best field to use is
|
Beta Was this translation helpful? Give feedback.
-
Hi,
I have a question regarding the contents of a document.
My first questions are:
'plain_text'
field is non empty for a certain docket-entry, is that opinion integrated into the opinions endpoint? (meaning, am I guaranteed to find that opinion through the opinions endpoint or is it possible that it will only be available through the docket-entries endpoint) if so, under which field would the'plain_text'
field from the docket-entries endpoint be available in its opinion endpoint counterpart?"plain_text", "html", "html_lawbox", "html_columbia", "html_anon_2020", "xml_harvard", "html_with_citations"
. Are they mutually exclusive depending on the source of the opinion? If they are not mutually exclusive, do they represent the same information, or can some fields complement each other? If they represent the same information, are there particular fields that I should prioritize over others given that I seek the most "complete" information? (For example, let's say a document has both aplain_text
and anhtml
fields, they both represent the same document however, the process of converting from HTML to plain text tends to cause certain information such as the signatures section to be discarded due to presentation or technical reasons. than I would prefer thehtml
field)Thank you for your time and help!
Beta Was this translation helpful? Give feedback.
All reactions