Given a scholarly PDF, extract figures, tables, captions, and section titles.
Quality information extraction at web scale. Edit
Science Parse parses scientific papers (in PDF form) and returns them in structured form.
NLP toolkit (tokenizer, POS-tagger, parser, etc.)
SBT Plugins for AI2 projects
Easily identify and label sentence intervals using various taggers.
A collection of useful utility classes and functions.
A scala wrapper for OpenRegex.
Store for immutable objects in S3
Word2Vec Java Port
TuffyLite is an open-source MLN inference engine that modifies the original Tuffy solver.
A corpus retrieval engine based on Apache Lucene