package matchbox
Package object which supplies implicits providing common UDF-related functionalities.
- Alphabetic
- By Inheritance
- matchbox
- AnyRef
- Any
- Hide All
- Show All
- Public
- All
Type Members
-
class
ImageDetails extends AnyRef
Information about an image.
Information about an image. e.g. width, height.
- implicit class WWWLink extends AnyRef
Value Members
-
object
ComputeImageSize
Image sizing utilities.
-
object
ComputeMD5
Compute MD5 checksum.
-
object
ComputeSHA1
Compute SHA1 checksum.
-
object
CovertLastModifiedDate
Converts RFC 1123 dates to yyyyMMddHHmmss.
-
object
DetectLanguage
Detects language using Apache Tika.
-
object
DetectMimeTypeTika
Detect MIME type using Apache Tika.
-
object
ExtractBoilerpipeText
Extract raw text content from an HTML page, minus "boilerplate" content (using boilerpipe).
-
object
ExtractDate
Gets different parts of a dateString.
-
object
ExtractDomain
Extracts the host domain name from a full url string.
-
object
ExtractImageDetails
Extracts image details given raw bytes.
-
object
ExtractImageLinks
Extracts image links from a webpage given the HTML content (using Jsoup).
-
object
ExtractLinks
Extracts links from a webpage given the HTML content (using Jsoup).
-
object
ExtractTextFromPDFs
Exacts texts from PDFs using Apache Tika.
-
object
GetExtensionMIME
Get file extension using MIME type, then URL extension.
-
object
RemoveHTML
Removes HTML markup with JSoup.
-
object
RemoveHTTPHeader
Remove HTTP headers.