Bigtable: Implement query sharding by generalizing ReadRows resume request builder. by igorbernstein2 · Pull Request #3103 · googleapis/google-cloud-java

igorbernstein2 · 2018-03-29T03:07:58Z

This extends the work done in #2986 to allow map reduce style frameworks like beam to split queries into shards and execute them in parallel. The mechanism for chopping off part of query in a resume request is very similar to splitting a query into multiple shards. The main difference is how many splits are used.

The common functionality is extracted to an internal RowSetUtil class that does all of the heavy lifting. The class is used both byReadRowsResumptionStrategy for computing the resume request and the newly introduced Query#shard method.

Also expose the ability to get a Query's bounding range. The combination of Query#shard & Query#getBound is needed to implement a Beam source

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bigtable: Implement query sharding by generalizing ReadRows resume request builder.#3103

Bigtable: Implement query sharding by generalizing ReadRows resume request builder.#3103
garrettjonesgoogle merged 10 commits intogoogleapis:masterfrom
igorbernstein2:query-sharding

igorbernstein2 commented Mar 29, 2018 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

igorbernstein2 commented Mar 29, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

igorbernstein2 commented Mar 29, 2018 •

edited

Loading