Package opennlp.tools.sentdetect
package opennlp.tools.sentdetect
Package related to identifying sentence boundaries.
-
ClassDescriptionDefault implementation of the
EndOfSentenceScanner.Generate event contexts for maxent decisions for sentence detection.ObjectStreamto clean up empty lines for empty line separated document streams.
- Skips empty line at training data start
- Transforms multiple empty lines in a row into one
- Replaces white space lines with empty lines
- TODO: Terminates last document with empty line if it is missing
This stream should be used by the components that mark empty lines to mark document boundaries.The NewlineSentenceDetectorassumes that sentences are line delimited and recognizes one sentence per non-empty line.Interface forSentenceDetectorMEcontext generators.A cross validator forsentence detectors.The interface for sentence detectors, which find the sentence boundaries in a text.TheSentenceDetectorEvaluatormeasures the performance of the givenSentenceDetectorwith the provided referenceSentenceSamples.The factory that providesSentenceDetectordefault implementations and resourcesA sentence detector for splitting up raw text into sentences.TheSentenceModelis the model used by a learnableSentenceDetector.ASentenceSamplecontains a document with begin indexes of the individual sentences.This class is a stream filter which reads a sentence by line samples from anObjectStreamand converts them intoSentenceSampleobjects.A thread-safe version ofSentenceDetectorME.