package matchbox
Package object which supplies implicits providing common UDF-related functionalities.
- Alphabetic
- By Inheritance
- matchbox
- AnyRef
- Any
- Hide All
- Show All
- Public
- All
Type Members
- 
      
      
      
        
      
    
      
        
        class
      
      
        ImageDetails extends AnyRef
      
      
      Information about an image. Information about an image. e.g. width, height. 
-  implicit class WWWLink extends AnyRef
Value Members
- 
      
      
      
        
      
    
      
        
        object
      
      
        ComputeImageSize
      
      
      Image sizing utilities. 
- 
      
      
      
        
      
    
      
        
        object
      
      
        ComputeMD5
      
      
      Compute MD5 checksum. 
- 
      
      
      
        
      
    
      
        
        object
      
      
        ComputeSHA1
      
      
      Compute SHA1 checksum. 
- 
      
      
      
        
      
    
      
        
        object
      
      
        DetectLanguage
      
      
      Detects language using Apache Tika. 
- 
      
      
      
        
      
    
      
        
        object
      
      
        DetectMimeTypeTika
      
      
      Detect MIME type using Apache Tika. 
- 
      
      
      
        
      
    
      
        
        object
      
      
        ExtractBoilerpipeText
      
      
      Extract raw text content from an HTML page, minus "boilerplate" content (using boilerpipe). 
- 
      
      
      
        
      
    
      
        
        object
      
      
        ExtractDate
      
      
      Gets different parts of a dateString. 
- 
      
      
      
        
      
    
      
        
        object
      
      
        ExtractDomain
      
      
      Extracts the host domain name from a full url string. 
- 
      
      
      
        
      
    
      
        
        object
      
      
        ExtractImageDetails
      
      
      Extracts image details given raw bytes. 
- 
      
      
      
        
      
    
      
        
        object
      
      
        ExtractImageLinks
      
      
      Extracts image links from a webpage given the HTML content (using Jsoup). 
- 
      
      
      
        
      
    
      
        
        object
      
      
        ExtractLinks
      
      
      Extracts links from a webpage given the HTML content (using Jsoup). 
- 
      
      
      
        
      
    
      
        
        object
      
      
        ExtractTextFromPDFs
      
      
      Exacts texts from PDFs using Apache Tika. 
- 
      
      
      
        
      
    
      
        
        object
      
      
        GetExtensionMIME
      
      
      Get file extension using MIME type, then URL extension. 
- 
      
      
      
        
      
    
      
        
        object
      
      
        RemoveHTML
      
      
      Removes HTML markup with JSoup. 
- 
      
      
      
        
      
    
      
        
        object
      
      
        RemoveHTTPHeader
      
      
      Remove HTTP headers.