Package io.archivesunleashed.data
Class ArchiveRecordInputFormat
- java.lang.Object
-
- org.apache.hadoop.mapreduce.InputFormat<K,V>
-
- org.apache.hadoop.mapreduce.lib.input.FileInputFormat<org.apache.hadoop.io.LongWritable,ArchiveRecordWritable>
-
- io.archivesunleashed.data.ArchiveRecordInputFormat
-
public class ArchiveRecordInputFormat extends org.apache.hadoop.mapreduce.lib.input.FileInputFormat<org.apache.hadoop.io.LongWritable,ArchiveRecordWritable>
Extends FileInputFormat for Web Archive Commons InputFormat.
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description class
ArchiveRecordInputFormat.ArchiveRecordReader
Extends RecordReader for Record Reader.
-
Constructor Summary
Constructors Constructor Description ArchiveRecordInputFormat()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description org.apache.hadoop.mapreduce.RecordReader<org.apache.hadoop.io.LongWritable,ArchiveRecordWritable>
createRecordReader(org.apache.hadoop.mapreduce.InputSplit split, org.apache.hadoop.mapreduce.TaskAttemptContext context)
protected boolean
isSplitable(org.apache.hadoop.mapreduce.JobContext context, org.apache.hadoop.fs.Path filename)
-
Methods inherited from class org.apache.hadoop.mapreduce.lib.input.FileInputFormat
addInputPath, addInputPathRecursively, addInputPaths, computeSplitSize, getBlockIndex, getFormatMinSplitSize, getInputDirRecursive, getInputPathFilter, getInputPaths, getMaxSplitSize, getMinSplitSize, getSplits, listStatus, makeSplit, makeSplit, setInputDirRecursive, setInputPathFilter, setInputPaths, setInputPaths, setMaxInputSplitSize, setMinInputSplitSize
-
-
-
-
Constructor Detail
-
ArchiveRecordInputFormat
public ArchiveRecordInputFormat()
-
-
Method Detail
-
createRecordReader
public final org.apache.hadoop.mapreduce.RecordReader<org.apache.hadoop.io.LongWritable,ArchiveRecordWritable> createRecordReader(org.apache.hadoop.mapreduce.InputSplit split, org.apache.hadoop.mapreduce.TaskAttemptContext context) throws IOException, InterruptedException
- Specified by:
createRecordReader
in classorg.apache.hadoop.mapreduce.InputFormat<org.apache.hadoop.io.LongWritable,ArchiveRecordWritable>
- Throws:
IOException
InterruptedException
-
isSplitable
protected final boolean isSplitable(org.apache.hadoop.mapreduce.JobContext context, org.apache.hadoop.fs.Path filename)
- Overrides:
isSplitable
in classorg.apache.hadoop.mapreduce.lib.input.FileInputFormat<org.apache.hadoop.io.LongWritable,ArchiveRecordWritable>
-
-