Interface SentenceTokenizer

All Superinterfaces:
Tokenizer
All Known Implementing Classes:
SimpleSentenceTokenizer, SRXSentenceTokenizer

public interface SentenceTokenizer extends Tokenizer
Tokenizes text into sentences.
  • Method Details

    • tokenize

      List<String> tokenize(String text)
      Tokenize the given string to sentences.
      Specified by:
      tokenize in interface Tokenizer
    • setSingleLineBreaksMarksParagraph

      void setSingleLineBreaksMarksParagraph(boolean lineBreakParagraphs)
      Parameters:
      lineBreakParagraphs - if true, single line breaks are assumed to end a paragraph, with false, only two ore more consecutive line breaks end a paragraph
    • singleLineBreaksMarksPara

      boolean singleLineBreaksMarksPara()