Welcome to the SharpNLP Project
SharpNLP is a collection of natural language processing tools written in C#. Currently it provides the following NLP tools:
- a sentence splitter
- a tokenizer
- a part-of-speech tagger
- a chunker (used to "find non-recursive syntactic annotations such as noun phrase chunks")
- a parser
- a name finder
- a coreference tool
- an interface to the WordNet lexical database
SharpNLP 1.0.2529 BetaThis is the latest release of SharpNLP. To use it you will need to download either the runtime binaries or the source code (
SharpNLP 1.0.2529 Beta), plus the model files (
NBin Models 1.0.0 Production). Detailed instructions can be found in the
Installation Instructions.
What is SharpNLPOpenNLP is both the name of a group of open source projects (
http://opennlp.sourceforge.net) related to natural language processing (NLP), and the name of a library of NLP tools written in Java (
http://sourceforge.net/projects/opennlp) by Jason Baldridge, Tom Morton, and Gann Bierner. SharpNLP is a C# port of the Java OpenNLP tools, plus additional code to facilitate natural language processing.
All of these tools are driven by maximum entropy models processed by the SharpEntropy library.
In addition, SharpNLP contains a WordNet database access library called SharpWordNet.
How can I help?Developer GuideMore information can be obtained from these CodeProject articles:
http://www.codeproject.com/csharp/englishparsing.asphttp://www.codeproject.com/cs/library/sharpentropy.asp