Packages

package matchbox

Package object which supplies implicits providing common UDF-related functionalities.

Linear Supertypes
AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. matchbox
  2. AnyRef
  3. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Type Members

  1. class ImageDetails extends AnyRef

    Information about an image.

    Information about an image. e.g. width, height.

  2. implicit class WWWLink extends AnyRef

Value Members

  1. object ComputeImageSize

    Image sizing utilities.

  2. object ComputeMD5

    Compute MD5 checksum.

  3. object ComputeSHA1

    Compute SHA1 checksum.

  4. object CovertLastModifiedDate

    Converts RFC 1123 dates to yyyyMMddHHmmss.

  5. object DetectLanguage

    Detects language using Apache Tika.

  6. object DetectMimeTypeTika

    Detect MIME type using Apache Tika.

  7. object ExtractBoilerpipeText

    Extract raw text content from an HTML page, minus "boilerplate" content (using boilerpipe).

  8. object ExtractDate

    Gets different parts of a dateString.

  9. object ExtractDomain

    Extracts the host domain name from a full url string.

  10. object ExtractImageDetails

    Extracts image details given raw bytes.

  11. object ExtractImageLinks

    Extracts image links from a webpage given the HTML content (using Jsoup).

  12. object ExtractLinks

    Extracts links from a webpage given the HTML content (using Jsoup).

  13. object ExtractTextFromPDFs

    Exacts texts from PDFs using Apache Tika.

  14. object GetExtensionMIME

    Get file extension using MIME type, then URL extension.

  15. object RemoveHTML

    Removes HTML markup with JSoup.

  16. object RemoveHTTPHeader

    Remove HTTP headers.

Inherited from AnyRef

Inherited from Any

Ungrouped