Package org.apache.flink.formats.parquet
Class ParquetVectorizedInputFormat<T,SplitT extends org.apache.flink.connector.file.src.FileSourceSplit>
- java.lang.Object
-
- org.apache.flink.formats.parquet.ParquetVectorizedInputFormat<T,SplitT>
-
- All Implemented Interfaces:
Serializable,org.apache.flink.api.java.typeutils.ResultTypeQueryable<T>,org.apache.flink.connector.file.src.reader.BulkFormat<T,SplitT>
- Direct Known Subclasses:
ParquetColumnarRowInputFormat
public abstract class ParquetVectorizedInputFormat<T,SplitT extends org.apache.flink.connector.file.src.FileSourceSplit> extends Object implements org.apache.flink.connector.file.src.reader.BulkFormat<T,SplitT>
ParquetBulkFormatthat reads data from the file toVectorizedColumnBatchin vectorized mode.- See Also:
- Serialized Form
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description protected static classParquetVectorizedInputFormat.ParquetReaderBatch<T>Reader batch that provides writing and reading capabilities.
-
Field Summary
Fields Modifier and Type Field Description protected SerializableConfigurationhadoopConfigprotected booleanisUtcTimestamp
-
Constructor Summary
Constructors Constructor Description ParquetVectorizedInputFormat(SerializableConfiguration hadoopConfig, org.apache.flink.table.types.logical.RowType projectedType, ColumnBatchFactory<SplitT> batchFactory, int batchSize, boolean isUtcTimestamp, boolean isCaseSensitive)
-
Method Summary
All Methods Instance Methods Abstract Methods Concrete Methods Modifier and Type Method Description org.apache.flink.formats.parquet.ParquetVectorizedInputFormat.ParquetReadercreateReader(org.apache.flink.configuration.Configuration config, SplitT split)protected abstract ParquetVectorizedInputFormat.ParquetReaderBatch<T>createReaderBatch(org.apache.flink.table.data.columnar.vector.writable.WritableColumnVector[] writableVectors, org.apache.flink.table.data.columnar.vector.VectorizedColumnBatch columnarBatch, org.apache.flink.connector.file.src.util.Pool.Recycler<ParquetVectorizedInputFormat.ParquetReaderBatch<T>> recycler)booleanisSplittable()protected intnumBatchesToCirculate(org.apache.flink.configuration.Configuration config)org.apache.flink.formats.parquet.ParquetVectorizedInputFormat.ParquetReaderrestoreReader(org.apache.flink.configuration.Configuration config, SplitT split)
-
-
-
Field Detail
-
hadoopConfig
protected final SerializableConfiguration hadoopConfig
-
isUtcTimestamp
protected final boolean isUtcTimestamp
-
-
Constructor Detail
-
ParquetVectorizedInputFormat
public ParquetVectorizedInputFormat(SerializableConfiguration hadoopConfig, org.apache.flink.table.types.logical.RowType projectedType, ColumnBatchFactory<SplitT> batchFactory, int batchSize, boolean isUtcTimestamp, boolean isCaseSensitive)
-
-
Method Detail
-
createReader
public org.apache.flink.formats.parquet.ParquetVectorizedInputFormat.ParquetReader createReader(org.apache.flink.configuration.Configuration config, SplitT split) throws IOException- Specified by:
createReaderin interfaceorg.apache.flink.connector.file.src.reader.BulkFormat<T,SplitT extends org.apache.flink.connector.file.src.FileSourceSplit>- Throws:
IOException
-
numBatchesToCirculate
protected int numBatchesToCirculate(org.apache.flink.configuration.Configuration config)
-
restoreReader
public org.apache.flink.formats.parquet.ParquetVectorizedInputFormat.ParquetReader restoreReader(org.apache.flink.configuration.Configuration config, SplitT split) throws IOException- Specified by:
restoreReaderin interfaceorg.apache.flink.connector.file.src.reader.BulkFormat<T,SplitT extends org.apache.flink.connector.file.src.FileSourceSplit>- Throws:
IOException
-
isSplittable
public boolean isSplittable()
-
createReaderBatch
protected abstract ParquetVectorizedInputFormat.ParquetReaderBatch<T> createReaderBatch(org.apache.flink.table.data.columnar.vector.writable.WritableColumnVector[] writableVectors, org.apache.flink.table.data.columnar.vector.VectorizedColumnBatch columnarBatch, org.apache.flink.connector.file.src.util.Pool.Recycler<ParquetVectorizedInputFormat.ParquetReaderBatch<T>> recycler)
- Parameters:
writableVectors- vectors to be writecolumnarBatch- vectors to be readrecycler- batch recycler
-
-