Package org.apache.flink.formats.parquet.vector.reader
-
Interface Summary Interface Description ColumnReader<VECTOR extends org.apache.flink.table.data.columnar.vector.writable.WritableColumnVector> Read a batch of records for a column toWritableColumnVectorfrom parquet data file.ParquetDataColumnReader The interface to wrap the underlying Parquet dictionary and non dictionary encoded page reader. -
Class Summary Class Description AbstractColumnReader<VECTOR extends org.apache.flink.table.data.columnar.vector.writable.WritableColumnVector> AbstractColumnReader.ArrayColumnReader ArrayColumnReader.BaseVectorizedColumnReader It's column level Parquet reader which is used to read a batch of records for a column, part of the code is referred from Apache Hive and Apache Parquet.BaseVectorizedColumnReader.NullIntIterator Reading zero always.BaseVectorizedColumnReader.RLEIntIterator Reading int fromRunLengthBitPackingHybridDecoder.BaseVectorizedColumnReader.ValuesReaderIntIterator Reading int fromValuesReader.BooleanColumnReader BooleanColumnReader.ByteColumnReader ByteColumnReader.BytesColumnReader BytesColumnReader.DoubleColumnReader DoubleColumnReader.FixedLenBytesColumnReader<VECTOR extends org.apache.flink.table.data.columnar.vector.writable.WritableColumnVector> Fixed length bytesColumnReader, just for decimal.FloatColumnReader FloatColumnReader.IntColumnReader IntColumnReader.LongColumnReader LongColumnReader.MapColumnReader MapColumnReader.ParquetDataColumnReaderFactory Parquet file has self-describing schema which may differ from the user required schema (e.g.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader The default data column reader for existing Parquet page reader which works for both dictionary or non dictionary types, Mirror from dictionary encoding path.ParquetDataColumnReaderFactory.TypesFromInt96PageReader The reader who reads from the underlying Timestamp value.RowColumnReader RowColumnReader.ShortColumnReader ShortColumnReader.TimestampColumnReader TimestampColumnReader.