Skip to content
Advertisement

AvroParquetOutputFormat – Unable to Write Arrays with Null Elements

I’m using v1.11.1 of the parquet-mr library as part of a Java application that takes Avro records and writes them into Parquet files using the AvroParquetOutputFormat. There are Avro records with array type fields that will have null elements, e.g.

JavaScript

Here’s an example Avro schema:

JavaScript

I’m trying to write the following record:

JavaScript

I thought I could use the 3-level list writer to support this, however, it results in the following exception:

JavaScript

Is this kind of record supported? I have also tried the parquet.avro.add-list-element-records option set to false as well, with no luck.

EDIT: I have created a Parquet JIRA for tracking this issue.

Advertisement

Answer

For those interested – this required a patch, and it was merged to master here. Corresponding JIRA.

User contributions licensed under: CC BY-SA
8 People found this is helpful
Advertisement