GlobalWindow cannot be cast to IntervalWindow

Question

I am trying to run a sliding window calculation on a set of tuples which is in a csv file. Each row has a date associated with it. After reading the csv file using TextIO.Read I apply a ParDo transformation to change the timestamp of each element in the PCollection.

//Reading and time stamping the stock prices
PCollection> stockPrices = pipeline
  .apply(TextIO.Read.from("./inputFiles/2004.csv"))
  .apply(ParDo.of(new DoFn>() {
    @Override
    public void processElement(ProcessContext c) throws Exception {
      String[] fields = c.element().split(",");
      try {
        StockPricePoint stockPoint = new StockPricePoint();
        stockPoint.setId(fields[0]);
        SimpleDateFormat sdf = new SimpleDateFormat("yyyymmdd");
        stockPoint.setDate(sdf.parse(fields[1].trim()));
        stockPoint.setSymbol(fields[2]);
        stockPoint.setPrice(Double.parseDouble(fields[5].trim()));
        stockPoint.setCap(Double.parseDouble(fields[6].trim()));
        Instant instant = new Instant(stockPoint.getDate().getTime());
        c.outputWithTimestamp(KV.of(
          symbolEncoder.getSymbolIndex(stockPoint.getSymbol()), stockPoint), 
          instant);
      } catch (Exception ex) {
        //Todo accumulate errors
        ex.printStackTrace();
      }
    }
  });

Then I apply a sliding window transformation as follows

//creating the sliding windows
PCollection> slidingWindowStockPrices = stockPrices
  .apply(Window.>into(
    SlidingWindows.of(Duration.standardDays(30))
        .every(Duration.standardDays(5)));

After than when I invoke a GroupByKey transformation as follows I get a GlobalWindow cannot be cast to IntervalWindow Exception. What could be possibly going wrong here?

   slidingWindowStockPrices.apply(GroupByKey.create());

Full stacktrace can be found here http://pastebin.com/QUStvrfB

Ben Chambers · Accepted Answer

This does not look like a problem with the Google Cloud Dataflow service. You can also try the DirectPipelineRunner for local testing.

It looks like a problem with the Spark runner, which may not yet implement the full semantics necessary for triggering in the Global Window. Spark runner is now maintained by Apache Beam (Incubating) project. I have filed a ticket on the project's Jira to track this issue.

GlobalWindow cannot be cast to IntervalWindow

Answers (2)

Related Questions