Improved cursor comparison - only showing different lines #527

pesse · Nov 29, 2017

At least partly fixes #454

jgebal · Nov 29, 2017

source/expectations/data_values/ut_cursor_data_diff.sql

@@ -0,0 +1,4 @@
+create global temporary table ut_cursor_data_diff(


comments in tables are allowed but no empty lines.

Pazus

Except a small refactor suggestion everything is ok.

Pazus · Nov 29, 2017

source/expectations/data_values/ut_data_value_refcursor.tpb

-       where nvl(dbms_lob.compare(xmlserialize( content exp.row_data no indent), xmlserialize( content act.row_data no indent)),1) != 0
-         and rownum <= 1;
+       where nvl(dbms_lob.compare(xmlserialize( content exp.row_data no indent), xmlserialize( content act.row_data no indent)),1) != 0;
+      select count(1) into l_result from ut_cursor_data_comp where rownum <= 1;


You can use SQL%ROWCOUNT instead of selecting rows number after select.

jgebal · Nov 29, 2017

Can you add tests, so are covered for regression?

Succeeds when cursors are same.
Succeeds bot when cursors are empty.
Fails and returns rows from actual that are not in expected
Fails and returns rows from expected that are not in actual

any others?

jgebal · Nov 29, 2017

My concern is that there is no limit on the data returned.
If the cursor will contain 100 cols and 10M rows, it will cause some serious issues.

Pazus · Nov 29, 2017

Good point, Jacek. We still need a limit...

jgebal · Nov 29, 2017

source/expectations/data_values/ut_data_value_refcursor.tpb

        from (select case when l_xpath is not null then deletexml( ucd.row_data, l_xpath ) else ucd.row_data end as row_data,
                     ucd.row_no
                from ut_cursor_data ucd where ucd.cursor_data_guid = self.data_value) exp
        full outer join (select case when l_xpath is not null then deletexml( ucd.row_data, l_xpath ) else ucd.row_data end as row_data,
                                ucd.row_no
                           from ut_cursor_data ucd where ucd.cursor_data_guid = l_other.data_value) act
         on (exp.row_no = act.row_no)
-       where nvl(dbms_lob.compare(xmlserialize( content exp.row_data no indent), xmlserialize( content act.row_data no indent)),1) != 0
-         and rownum <= 1;
+       where nvl(dbms_lob.compare(xmlserialize( content exp.row_data no indent), xmlserialize( content act.row_data no indent)),1) != 0;


I would add limit here (in the insert statement), so that we only process first 50 - 100 differences.
Why 50 - 100?
10 seems not enough, but anything more than 50 - 100 doesn't bring much value anyway.
If you have so may differences, you're in serious trouble anyway.

If first 100 rows doesn't help you identify what the problem is - your test is not distilled properly and you're testing all in one go probably.

Second thought on this idea.
Compare all rows.
Remember toe count of:

rows in A

rows in B

differences
Report first x differences (having x 50 .. 100)
Report also number of rows in A, number of rows in B and total number of differences.

That will give much more information about failed expectations,

Had something similar in mind myself - would definitely help to output exact number of mismatches because we got them anyways...

mathewbutler · Nov 29, 2017

I said a long time ago that I’d look at an alternative implementation. The best I’ve done is to put together a simple SQL implementation of a compare. The below does two full table scans to identify differences. It’s an asktom special - forget the name of the person on the three that worked through the problem to come up with this.

select c1, c2, c3,
2 count(src1) CNT1,
3 count(src2) CNT2
4 from
5 ( select a.,
6 1 src1,
7 to_number(null) src2
8 from a
9 union all
10 select b.,
11 to_number(null) src1,
12 2 src2
13 from b
14 )
15 group by c1,c2,c3
16 having count(src1) <> count(src2)

My idea was to use this and dynamically produce the query. With testing to see if special handling needed for all data types.

Laptop times for comparing two 1M tables were .8 seconds and low resource usage.

Leaving this here, as above forms the basis of a POC. We said that we wouldn’t change the implementation unless there were clear benefits or new requirements.

Cheers.

jgebal · Dec 1, 2017

@mathewbutler Agreed.
We can always do this for simple datatypes, on top of what we do.

That is, if not clob, blob, xmltype etc - use plain SQL compare, else do to XML and compare CLOBs.

For now, we definitely want to improve reporting side, so that we show rows that differ instead of first 'x' rows from data-set.
I think it's a nice step forward.

Still having to download and install ojdbc.jar

...because of owner issues

pesse · Dec 1, 2017

Seems the changes broke the exclude-possibility of columns. Might take me some time to get it sorted out.

# Conflicts: # development/refresh_sources.sh

Fixed failing tests fro 11g (record types in SQL) as well as old_test. Refactored the comparison a little bit.

…success. Added test for empty vs null cursor.

Added some additional tests.

…rsor_comparison

Improved cursor comparison - only showing different lines

df98ebc

pesse added the in progress label Nov 29, 2017

jgebal reviewed Nov 29, 2017

View reviewed changes

Pazus reviewed Nov 29, 2017

View reviewed changes

Update ut_cursor_data_diff.sql

ca63cda

jgebal reviewed Nov 29, 2017

View reviewed changes

pesse added 6 commits December 1, 2017 10:53

Updated development scripts

f3b6fa4

Still having to download and install ojdbc.jar

Updated contribution docs with ojdbc.jar notice

a340746

Simple regression tests

4d33463

Refactoring refcursor dataValue to dynamic sql

fcf6117

...because of owner issues

Easier to understand return value

bab7da8

Improved feedback on refcursor comparison

b7ac3a4

jgebal and others added 10 commits December 6, 2017 21:11

Merge branch 'develop' into feature/improve_cursor_comparison

a0c2f59

Merge branch 'develop' into feature/improve_cursor_comparison

9febac9

# Conflicts: # development/refresh_sources.sh

Fixed return code of data-comparison.

72261e1

Fixed failing tests fro 11g (record types in SQL) as well as old_test. Refactored the comparison a little bit.

Fixed a bug, where empty cursor compared with null cursor was giving …

91ba637

…success. Added test for empty vs null cursor.

Migrates some cursor tests to new style.

2f3479b

Merge branch 'develop' into feature/improve_cursor_comparison

fb44153

Migrated few more tests for cursor comparison.

ec5784b

Added some additional tests.

Merge remote-tracking branch 'origin/develop' into feature/improve_cu…

01ec495

…rsor_comparison

Added tests for cursor.

8955f5a

Merge branch 'develop' into feature/improve_cursor_comparison

33a9b90

jgebal merged commit 2137439 into develop Jan 5, 2018

jgebal deleted the feature/improve_cursor_comparison branch January 5, 2018 19:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improved cursor comparison - only showing different lines #527

Improved cursor comparison - only showing different lines #527

Uh oh!

pesse commented Nov 29, 2017

Uh oh!

jgebal Nov 29, 2017

Uh oh!

Pazus left a comment

Uh oh!

Pazus Nov 29, 2017

Uh oh!

jgebal commented Nov 29, 2017 •

edited

Loading

Uh oh!

jgebal commented Nov 29, 2017

Uh oh!

Pazus commented Nov 29, 2017

Uh oh!

jgebal Nov 29, 2017

Uh oh!

jgebal Dec 1, 2017 •

edited

Loading

Uh oh!

pesse Dec 1, 2017

Uh oh!

mathewbutler commented Nov 29, 2017

Uh oh!

jgebal commented Dec 1, 2017

Uh oh!

pesse commented Dec 1, 2017

Uh oh!

Uh oh!

		@@ -0,0 +1,4 @@
		create global temporary table ut_cursor_data_diff(

Search code, repositories, users, issues, pull requests...

Improved cursor comparison - only showing different lines #527

Improved cursor comparison - only showing different lines #527

Uh oh!

Conversation

pesse commented Nov 29, 2017

Uh oh!

jgebal Nov 29, 2017

Choose a reason for hiding this comment

Uh oh!

Pazus left a comment

Choose a reason for hiding this comment

Uh oh!

Pazus Nov 29, 2017

Choose a reason for hiding this comment

Uh oh!

jgebal commented Nov 29, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jgebal commented Nov 29, 2017

Uh oh!

Pazus commented Nov 29, 2017

Uh oh!

jgebal Nov 29, 2017

Choose a reason for hiding this comment

Uh oh!

jgebal Dec 1, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pesse Dec 1, 2017

Choose a reason for hiding this comment

Uh oh!

mathewbutler commented Nov 29, 2017

Uh oh!

jgebal commented Dec 1, 2017

Uh oh!

pesse commented Dec 1, 2017

Uh oh!

Uh oh!

jgebal commented Nov 29, 2017 •

edited

Loading

jgebal Dec 1, 2017 •

edited

Loading