Skip to content
Advertisement

Dataflow: string to pubsub message

I’m trying to make unit testing in Dataflow.

For that test, at the begging, I will start with a simple hardcoded string.

The problem is that I would need to transform that string to a pubsub message. I got the following code to do that:

JavaScript

But I get the following error:

JavaScript

How I should create the pubsub message from the string?

Advertisement

Answer

In the Beam Programming Guide under Serializability requirements for user ParDos, it mentions this:

Take care when declaring your function object inline by using an anonymous inner class instance. In a non-static context, your inner class instance will implicitly contain a pointer to the enclosing class and that class’ state. That enclosing class will also be serialized, and thus the same considerations that apply to the function object itself also apply to this outer class.

What’s happening is that your anonymous DoFn implicitly contains a pointer to the class you’re constructing the pipeline in, which is leading to this serialization failure. You can avoid this by making your DoFn a named subclass instead of anonymous:

JavaScript
Advertisement