[PECO-1532] Ignore the excess records in query results by kravets-levko · Pull Request #239 · databricks/databricks-sql-nodejs

kravets-levko · Mar 21, 2024

Note for reviewers: This PR contains some refactoring needed to implement the fix, so probably it's easier to review commit by commit

When client library executes query and wants an Arrow-based or Cloudfetch results - server will return records as Arrow batches. Batch size may vary, server makes the decision on that depending on count of records, record size, etc. But usually all batches will have the same size, with the only exception - the last batch, which usually contains less records. And there are two possibilities:

server may make the last batch smaller and containing only the remaining records;
server may actually fetch more record than needed to make the last batch of the same size as others. But it also sets a rowCount field which defines how may "valid" records are in the batch. Client should use only that records and discard the remaining ones.

(I guess that different workspaces may be configured differently and will behave either as described in scenario 1 or 2)

Nodejs connector doesn’t use value from rowCount and therefore returns that extra records to user. This behavior is wrong, and this PR fixes it.

…th raw batch data Signed-off-by: Levko Kravets <levko.ne@gmail.com>

Signed-off-by: Levko Kravets <levko.ne@gmail.com>

benc-db

Looks reasonable to me, but I'm neither a node nor thrift semantics expert.

[PECO-1532] Arrow and CloudFetch result handlers: return row count wi…

85343e9

…th raw batch data Signed-off-by: Levko Kravets <levko.ne@gmail.com>

kravets-levko temporarily deployed to azure-prod March 21, 2024 12:24 — with GitHub Actions Inactive

Update tests

f88ecc6

Signed-off-by: Levko Kravets <levko.ne@gmail.com>

kravets-levko temporarily deployed to azure-prod March 21, 2024 15:43 — with GitHub Actions Inactive

Refactor ArrowResultConverter - cleanup and make it skip empty batches

374af38

Signed-off-by: Levko Kravets <levko.ne@gmail.com>

kravets-levko temporarily deployed to azure-prod March 21, 2024 19:43 — with GitHub Actions Inactive

databricks deleted a comment from codecov-commenter Mar 21, 2024

[PECO-1532] Ignore the excess records in arrow batches

560ffca

Signed-off-by: Levko Kravets <levko.ne@gmail.com>

kravets-levko temporarily deployed to azure-prod March 25, 2024 11:48 — with GitHub Actions Inactive

databricks deleted a comment from codecov-commenter Mar 25, 2024

Add tests

b8287a5

Signed-off-by: Levko Kravets <levko.ne@gmail.com>

kravets-levko temporarily deployed to azure-prod March 25, 2024 15:23 — with GitHub Actions Inactive

kravets-levko marked this pull request as ready for review March 25, 2024 15:29

kravets-levko requested review from andrefurlan-db, arikfr, benc-db, jackyhu-db, rcypher-databricks, superdupershant and yunbodeng-db as code owners March 25, 2024 15:29

benc-db approved these changes Mar 26, 2024

View reviewed changes

kravets-levko merged commit 1dc16ac into main Mar 27, 2024

kravets-levko deleted the PECO-1532-ignore-excess-records branch March 27, 2024 12:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PECO-1532] Ignore the excess records in query results#239

[PECO-1532] Ignore the excess records in query results#239
kravets-levko merged 5 commits into
maindatabricks/databricks-sql-nodejs:mainfrom
PECO-1532-ignore-excess-recordsdatabricks/databricks-sql-nodejs:PECO-1532-ignore-excess-recordsCopy head branch name to clipboard

kravets-levko commented Mar 21, 2024 •

edited

Loading

Uh oh!

benc-db left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Search code, repositories, users, issues, pull requests...

Conversation

kravets-levko commented Mar 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

benc-db left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kravets-levko commented Mar 21, 2024 •

edited

Loading