A C D F G I N R S T U V W
All Classes All Packages
All Classes All Packages
All Classes All Packages
A
- ARC - io.archivesunleashed.data.ArchiveRecordWritable.ArchiveFormat
-
ARC format.
- ArchiveRecordInputFormat - Class in io.archivesunleashed.data
-
Extends FileInputFormat for Web Archive Commons InputFormat.
- ArchiveRecordInputFormat() - Constructor for class io.archivesunleashed.data.ArchiveRecordInputFormat
- ArchiveRecordInputFormat.ArchiveRecordReader - Class in io.archivesunleashed.data
-
Extends RecordReader for Record Reader.
- ArchiveRecordReader() - Constructor for class io.archivesunleashed.data.ArchiveRecordInputFormat.ArchiveRecordReader
- ArchiveRecordWritable - Class in io.archivesunleashed.data
-
Implements Hadoop Writable for Archive Records.
- ArchiveRecordWritable() - Constructor for class io.archivesunleashed.data.ArchiveRecordWritable
-
Utility function.
- ArchiveRecordWritable(ArchiveRecord) - Constructor for class io.archivesunleashed.data.ArchiveRecordWritable
-
Initialize Archive Record.
- ArchiveRecordWritable.ArchiveFormat - Enum in io.archivesunleashed.data
-
Archive Formats that can be used.
- ArcRecordUtils - Class in io.archivesunleashed.data
-
Utilities for working with
ARCRecord
s (from archive.org APIs).
C
- close() - Method in class io.archivesunleashed.data.ArchiveRecordInputFormat.ArchiveRecordReader
- createRecordReader(InputSplit, TaskAttemptContext) - Method in class io.archivesunleashed.data.ArchiveRecordInputFormat
D
- detectFormat() - Method in class io.archivesunleashed.data.ArchiveRecordWritable
-
Detect format of Archive Record.
F
- fromBytes(byte[]) - Static method in class io.archivesunleashed.data.ArcRecordUtils
-
Converts raw bytes into an
ARCRecord
. - fromBytes(byte[]) - Static method in class io.archivesunleashed.data.WarcRecordUtils
-
Converts raw bytes into an
WARCRecord
.
G
- getBodyContent(ARCRecord) - Static method in class io.archivesunleashed.data.ArcRecordUtils
-
Extracts contents of the body from an
ARCRecord
. - getBodyContent(WARCRecord) - Static method in class io.archivesunleashed.data.WarcRecordUtils
-
Extracts contents of the body from a
WARCRecord
. - getContent(ARCRecord) - Static method in class io.archivesunleashed.data.ArcRecordUtils
-
Extracts raw contents from an
ARCRecord
(including HTTP headers). - getContent(WARCRecord) - Static method in class io.archivesunleashed.data.WarcRecordUtils
-
Extracts raw contents from a
WARCRecord
(including HTTP headers). - getCurrentKey() - Method in class io.archivesunleashed.data.ArchiveRecordInputFormat.ArchiveRecordReader
- getCurrentValue() - Method in class io.archivesunleashed.data.ArchiveRecordInputFormat.ArchiveRecordReader
- getFormat() - Method in class io.archivesunleashed.data.ArchiveRecordWritable
-
Get format of Archive Record.
- getProgress() - Method in class io.archivesunleashed.data.ArchiveRecordInputFormat.ArchiveRecordReader
- getRecord() - Method in class io.archivesunleashed.data.ArchiveRecordWritable
-
Get Archive Record.
- getWarcResponseMimeType(byte[]) - Static method in class io.archivesunleashed.data.WarcRecordUtils
-
Extracts the MIME type of WARC response records.
I
- initialize(InputSplit, TaskAttemptContext) - Method in class io.archivesunleashed.data.ArchiveRecordInputFormat.ArchiveRecordReader
- io.archivesunleashed.data - package io.archivesunleashed.data
-
This package provides various data utilities for analyzing WARCs.
- isSplitable(JobContext, Path) - Method in class io.archivesunleashed.data.ArchiveRecordInputFormat
N
- nextKeyValue() - Method in class io.archivesunleashed.data.ArchiveRecordInputFormat.ArchiveRecordReader
R
- readFields(DataInput) - Method in class io.archivesunleashed.data.ArchiveRecordWritable
S
- setFormat(ArchiveRecordWritable.ArchiveFormat) - Method in class io.archivesunleashed.data.ArchiveRecordWritable
-
Set format of Archive Record.
- setRecord(ArchiveRecord) - Method in class io.archivesunleashed.data.ArchiveRecordWritable
-
Set Archive Record.
T
- toBytes(ARCRecord) - Static method in class io.archivesunleashed.data.ArcRecordUtils
-
Converts ARC record into raw bytes.
- toBytes(WARCRecord) - Static method in class io.archivesunleashed.data.WarcRecordUtils
-
Converts WARC record into raw bytes.
U
- UNKNOWN - io.archivesunleashed.data.ArchiveRecordWritable.ArchiveFormat
-
UNKNOWN format.
V
- valueOf(String) - Static method in enum io.archivesunleashed.data.ArchiveRecordWritable.ArchiveFormat
-
Returns the enum constant of this type with the specified name.
- values() - Static method in enum io.archivesunleashed.data.ArchiveRecordWritable.ArchiveFormat
-
Returns an array containing the constants of this enum type, in the order they are declared.
W
- WARC - io.archivesunleashed.data.ArchiveRecordWritable.ArchiveFormat
-
WARC format.
- WarcRecordUtils - Class in io.archivesunleashed.data
-
Utilities for working with
WARCRecord
s (from archive.org APIs). - write(DataOutput) - Method in class io.archivesunleashed.data.ArchiveRecordWritable
All Classes All Packages