High Performance NLP with Apache Spark

John Snow Labs’ NLP is a text processing library built on top of Apache Spark and its Spark ML library. It's goal is to provide easy API for NLP annotations allowing a scalable approach within a distributed large scale environment.

Questions? Join our Slack

2018 May 2nd - Update! 1.5.3 Released! Model downloader now uses distributed filesystems, new spell checker and better assertion status. Plus bugfixes! Learn more HERE and check out updated documentation below

Get started

Quick start guide to setup spark-nlp and get going

Documentation

Pretrained models, pipelines and other concepts reference

Examples

Sample Notebooks, guideline to use SparkNLP

Contribute

Ways to Contribute to spark-nlp repository

Resources & FAQs

Videos, Podcasts, Whitepapers and other questions

License & Credits

Licensing / Acknowledgements