Automated Textbook Indexing with Naive Bayes Classifier Trained on Wikipedia Articles

This is my undergraduate honors thesis and the cumulation of my Computer Science education at North Central College. This project came into existence from a desire to use Wikipedia data as a corpus for Natural Language Processing. Since indexing textbooks is an expensive problem, it made sense to attempt to use the data for social good.

To download a copy, check out the "releases" tab on the top of the GitHub project page and select the latest version. I hope you find as much value in reading it as I did in writing it.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
code		code
data		data
tex		tex
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
document.tex		document.tex

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Automated Textbook Indexing with Naive Bayes Classifier Trained on Wikipedia Articles

About

Releases 5

Packages

Languages

mikeholler/thesis-undergrad

Folders and files

Latest commit

History

Repository files navigation

Automated Textbook Indexing with Naive Bayes Classifier Trained on Wikipedia Articles

About

Resources

Stars

Watchers

Forks

Releases 5

Packages 0

Languages

Packages