Class ParquetDataColumnReaderFactory
- java.lang.Object
-
- org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory
-
public final class ParquetDataColumnReaderFactory extends Object
Parquet file has self-describing schema which may differ from the user required schema (e.g. schema evolution). This factory is used to retrieve user required typed data via corresponding reader which reads the underlying data. Part of the code is referred from Apache Hive.
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static classParquetDataColumnReaderFactory.DefaultParquetDataColumnReaderThe default data column reader for existing Parquet page reader which works for both dictionary or non dictionary types, Mirror from dictionary encoding path.static classParquetDataColumnReaderFactory.TypesFromInt96PageReaderThe reader who reads from the underlying Timestamp value.
-
Method Summary
All Methods Static Methods Concrete Methods Modifier and Type Method Description static ParquetDataColumnReadergetDataColumnReaderByType(org.apache.parquet.schema.PrimitiveType parquetType, org.apache.parquet.column.values.ValuesReader realReader, boolean isUtcTimestamp)static ParquetDataColumnReadergetDataColumnReaderByTypeOnDictionary(org.apache.parquet.schema.PrimitiveType parquetType, org.apache.parquet.column.Dictionary realReader, boolean isUtcTimestamp)
-
-
-
Method Detail
-
getDataColumnReaderByTypeOnDictionary
public static ParquetDataColumnReader getDataColumnReaderByTypeOnDictionary(org.apache.parquet.schema.PrimitiveType parquetType, org.apache.parquet.column.Dictionary realReader, boolean isUtcTimestamp)
-
getDataColumnReaderByType
public static ParquetDataColumnReader getDataColumnReaderByType(org.apache.parquet.schema.PrimitiveType parquetType, org.apache.parquet.column.values.ValuesReader realReader, boolean isUtcTimestamp)
-
-