A C D F G I N R S T U V W 
All Classes All Packages

A

ARC - io.archivesunleashed.data.ArchiveRecordWritable.ArchiveFormat
ARC format.
ArchiveRecordInputFormat - Class in io.archivesunleashed.data
Extends FileInputFormat for Web Archive Commons InputFormat.
ArchiveRecordInputFormat() - Constructor for class io.archivesunleashed.data.ArchiveRecordInputFormat
 
ArchiveRecordInputFormat.ArchiveRecordReader - Class in io.archivesunleashed.data
Extends RecordReader for Record Reader.
ArchiveRecordReader() - Constructor for class io.archivesunleashed.data.ArchiveRecordInputFormat.ArchiveRecordReader
 
ArchiveRecordWritable - Class in io.archivesunleashed.data
Implements Hadoop Writable for Archive Records.
ArchiveRecordWritable() - Constructor for class io.archivesunleashed.data.ArchiveRecordWritable
Utility function.
ArchiveRecordWritable(ArchiveRecord) - Constructor for class io.archivesunleashed.data.ArchiveRecordWritable
Initialize Archive Record.
ArchiveRecordWritable.ArchiveFormat - Enum in io.archivesunleashed.data
Archive Formats that can be used.
ArcRecordUtils - Class in io.archivesunleashed.data
Utilities for working with ARCRecords (from archive.org APIs).

C

close() - Method in class io.archivesunleashed.data.ArchiveRecordInputFormat.ArchiveRecordReader
 
createRecordReader(InputSplit, TaskAttemptContext) - Method in class io.archivesunleashed.data.ArchiveRecordInputFormat
 

D

detectFormat() - Method in class io.archivesunleashed.data.ArchiveRecordWritable
Detect format of Archive Record.

F

fromBytes(byte[]) - Static method in class io.archivesunleashed.data.ArcRecordUtils
Converts raw bytes into an ARCRecord.
fromBytes(byte[]) - Static method in class io.archivesunleashed.data.WarcRecordUtils
Converts raw bytes into an WARCRecord.

G

getBodyContent(ARCRecord) - Static method in class io.archivesunleashed.data.ArcRecordUtils
Extracts contents of the body from an ARCRecord.
getBodyContent(WARCRecord) - Static method in class io.archivesunleashed.data.WarcRecordUtils
Extracts contents of the body from a WARCRecord.
getContent(ARCRecord) - Static method in class io.archivesunleashed.data.ArcRecordUtils
Extracts raw contents from an ARCRecord (including HTTP headers).
getContent(WARCRecord) - Static method in class io.archivesunleashed.data.WarcRecordUtils
Extracts raw contents from a WARCRecord (including HTTP headers).
getCurrentKey() - Method in class io.archivesunleashed.data.ArchiveRecordInputFormat.ArchiveRecordReader
 
getCurrentValue() - Method in class io.archivesunleashed.data.ArchiveRecordInputFormat.ArchiveRecordReader
 
getFormat() - Method in class io.archivesunleashed.data.ArchiveRecordWritable
Get format of Archive Record.
getProgress() - Method in class io.archivesunleashed.data.ArchiveRecordInputFormat.ArchiveRecordReader
 
getRecord() - Method in class io.archivesunleashed.data.ArchiveRecordWritable
Get Archive Record.
getWarcResponseMimeType(byte[]) - Static method in class io.archivesunleashed.data.WarcRecordUtils
Extracts the MIME type of WARC response records.

I

initialize(InputSplit, TaskAttemptContext) - Method in class io.archivesunleashed.data.ArchiveRecordInputFormat.ArchiveRecordReader
 
io.archivesunleashed.data - package io.archivesunleashed.data
This package provides various data utilities for analyzing WARCs.
isSplitable(JobContext, Path) - Method in class io.archivesunleashed.data.ArchiveRecordInputFormat
 

N

nextKeyValue() - Method in class io.archivesunleashed.data.ArchiveRecordInputFormat.ArchiveRecordReader
 

R

readFields(DataInput) - Method in class io.archivesunleashed.data.ArchiveRecordWritable
 

S

setFormat(ArchiveRecordWritable.ArchiveFormat) - Method in class io.archivesunleashed.data.ArchiveRecordWritable
Set format of Archive Record.
setRecord(ArchiveRecord) - Method in class io.archivesunleashed.data.ArchiveRecordWritable
Set Archive Record.

T

toBytes(ARCRecord) - Static method in class io.archivesunleashed.data.ArcRecordUtils
Converts ARC record into raw bytes.
toBytes(WARCRecord) - Static method in class io.archivesunleashed.data.WarcRecordUtils
Converts WARC record into raw bytes.

U

UNKNOWN - io.archivesunleashed.data.ArchiveRecordWritable.ArchiveFormat
UNKNOWN format.

V

valueOf(String) - Static method in enum io.archivesunleashed.data.ArchiveRecordWritable.ArchiveFormat
Returns the enum constant of this type with the specified name.
values() - Static method in enum io.archivesunleashed.data.ArchiveRecordWritable.ArchiveFormat
Returns an array containing the constants of this enum type, in the order they are declared.

W

WARC - io.archivesunleashed.data.ArchiveRecordWritable.ArchiveFormat
WARC format.
WarcRecordUtils - Class in io.archivesunleashed.data
Utilities for working with WARCRecords (from archive.org APIs).
write(DataOutput) - Method in class io.archivesunleashed.data.ArchiveRecordWritable
 
A C D F G I N R S T U V W 
All Classes All Packages