# This is a pseudo class that collect data in chunks from HDFS class ParquetChunkDataSource: raise NotImplementedError("Please implement your own dataloading logic")