docs: update documentation to account for parquet source#190
docs: update documentation to account for parquet source#190prakharmathur82 merged 12 commits intoraystack:mainraystack/dagger:mainfrom Meghajit:feat/dstrm#1009-update-dagger-documentationMeghajit/dagger:feat/dstrm#1009-update-dagger-documentationCopy head branch name to clipboard
Conversation
docs/docs/concepts/architecture.md
Outdated
There was a problem hiding this comment.
@prakharmathur82 But the image is getting rendered in the page. Do you mean you are not able to open the image via the link ?
There was a problem hiding this comment.
There was a problem hiding this comment.
Yeah, images don't get rendered on the markdown page for some reason. It was happening earlier also. You can check the master branch: https://github.com/odpf/dagger/blob/main/docs/docs/concepts/architecture.md
There was a problem hiding this comment.
Updated the relative paths
Fixed via commit 2de8360
docs/docs/concepts/lifecycle.md
Outdated
There was a problem hiding this comment.
Image is getting rendered though
There was a problem hiding this comment.
Updated the relative paths
Fixed via commit 2de8360
docs/docs/intro.md
Outdated
There was a problem hiding this comment.
Image is getting rendered though
There was a problem hiding this comment.
Updated the relative paths
Fixed via commit 2de8360
There was a problem hiding this comment.
why is source connected to proto handler?
why call it proto handler?
again check for pre processor workflow
same for name of proto handler towards sink
There was a problem hiding this comment.
why is source connected to proto handler?
Hmm. I guess the source should only be connected to the deserializer. The deserializer and proto handler work together to parse raw data ( parquet records, kafka records) into Row. Will fix this
why call it proto handler?
Yes, I should have called it Type Handler. It was a mistake during copy paste from the earlier diagram. I will edit the name.
again check for pre processor workflow
Will do this.
same for name of proto handler towards sink
Will change it to Type Handler
There was a problem hiding this comment.
Updated the diagrams
Fixed via commit 7242b10
README.md
Outdated
There was a problem hiding this comment.
how about real-time streaming and historical data?
There was a problem hiding this comment.
@prakharmathur82 We are able to process both real time + historical batched data ( parquet files) as a stream. Hence, put the streaming keyword after. Does the below look ok ?
Dagger or Data Aggregator is an easy-to-use, configuration over code, cloud-native framework built on top of Apache Flink for stateful processing of streaming data, both real time and historical.
There was a problem hiding this comment.
We can just say "stateful processing of data"
There was a problem hiding this comment.
we can say stateful processing of data
There was a problem hiding this comment.
Cool 👍
will do the change
docs/docs/concepts/architecture.md
Outdated
There was a problem hiding this comment.
Commented above for the same
docs/docs/guides/create_dagger.md
Outdated
docs/docs/guides/overview.md
Outdated
|
@Meghajit can you cross check if docs build is passing by running |
@ravisuhag Yes, it passed |


Closes #189