Class RegexUtils

java.lang.Object
org.apache.tika.utils.RegexUtils

public class RegexUtils extends Object
Inspired from Nutch code class OutlinkExtractor. Apply regex to extract content
  • Constructor Details

    • RegexUtils

      public RegexUtils()
  • Method Details

    • extractLinks

      public static List<String> extractLinks(String content)
      Extract urls from plain text.
      Parameters:
      content - The plain text content to examine
      Returns:
      List of urls within found in the plain text