Class ParquetDataColumnReaderFactory


  • public final class ParquetDataColumnReaderFactory
    extends Object
    Parquet file has self-describing schema which may differ from the user required schema (e.g. schema evolution). This factory is used to retrieve user required typed data via corresponding reader which reads the underlying data. Part of the code is referred from Apache Hive.
    • Method Detail

      • getDataColumnReaderByTypeOnDictionary

        public static ParquetDataColumnReader getDataColumnReaderByTypeOnDictionary​(org.apache.parquet.schema.PrimitiveType parquetType,
                                                                                    org.apache.parquet.column.Dictionary realReader,
                                                                                    boolean isUtcTimestamp)
      • getDataColumnReaderByType

        public static ParquetDataColumnReader getDataColumnReaderByType​(org.apache.parquet.schema.PrimitiveType parquetType,
                                                                        org.apache.parquet.column.values.ValuesReader realReader,
                                                                        boolean isUtcTimestamp)