Reputation: 349
We are using the "legacy" (non-PipelineAPI) version of the mapreduce library: http://code.google.com/p/appengine-mapreduce/
The issue is that we can only ever get one shard processing, even for kinds that have >150,000 entities. We have tried different shard_count configurations, e.g, 4, 16, 128, but always only one shard processing entire dataset, which is very slow.
I feel like I've missed a step (e.g., creating an index or something). We just this legacy library successfully on other applications and it runs with multiple shards as expected.
Crossing my fingers that someone knows an offhand answer.
Thanks, j
Upvotes: 1
Views: 101
Reputation: 349
Found it!
This issue occurs in Python and only when you override the default kind name for your model.
I have put a patch for this on the mapreduce issue tracker: http://code.google.com/p/appengine-mapreduce/issues/detail?id=154
Upvotes: 2