-
Notifications
You must be signed in to change notification settings - Fork 3.4k
[Fix](PaimonCatalog) fix the problem that paimon catalog can not access to OSS-HDFS #42585
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Thank you for your contribution to Apache Doris. Since 2024-03-18, the Document has been moved to doris-website. |
run buildall |
TPC-H: Total hot run time: 41330 ms
|
TPC-DS: Total hot run time: 197620 ms
|
ClickBench: Total hot run time: 32.52 s
|
TPC-H: Total hot run time: 41660 ms
|
TPC-DS: Total hot run time: 198802 ms
|
ClickBench: Total hot run time: 32.47 s
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
PR approved by at least one committer and no changes requested. |
PR approved by anyone and no changes requested. |
run buildall |
TPC-H: Total hot run time: 41400 ms
|
TPC-DS: Total hot run time: 197343 ms
|
ClickBench: Total hot run time: 32.2 s
|
PR approved by at least one committer and no changes requested. |
…ss to OSS-HDFS (#42585) Fix the problem that paimon catalog can not access to OSS-HDFS. There are 2 problems in paimon catalog: 1. Doris FE can not list paimon tables. This is because we pass these three properties -- `fs.oss.endpoint / fs.oss.accessKeyId / fs.oss.accessKeySecret` -- to the PaimonCatalog. When PaimonCatalog get these three properties, it will use `OSSLoader` rather than `HadoopFileIOLoader`. 2. Doris BE does not use libhdfs to access OSS-HDFS This is because the `tmpLocation` in `LocationPath` does not contain `oss-dls.aliyuncs`. We should use `endpoint` to judge if user wants to access OSS-HDFS What's more, if you want to access OSS-HDFS with PaimonCatalog, you should: 1. Download Jindo SDK: https://github.com/aliyun/alibabacloud-jindodata/blob/latest/docs/user/zh/jindosdk/jindosdk_download.md 2. copy `jindo-core.jar、jindo-sdk.jar` to `${DORIS_HOME}/fe/lib` and `${DORIS_HOME}/be/lib/java_extensions/preload-extensions` directory.
…ss to OSS-HDFS (apache#42585) Fix the problem that paimon catalog can not access to OSS-HDFS. There are 2 problems in paimon catalog: 1. Doris FE can not list paimon tables. This is because we pass these three properties -- `fs.oss.endpoint / fs.oss.accessKeyId / fs.oss.accessKeySecret` -- to the PaimonCatalog. When PaimonCatalog get these three properties, it will use `OSSLoader` rather than `HadoopFileIOLoader`. 2. Doris BE does not use libhdfs to access OSS-HDFS This is because the `tmpLocation` in `LocationPath` does not contain `oss-dls.aliyuncs`. We should use `endpoint` to judge if user wants to access OSS-HDFS What's more, if you want to access OSS-HDFS with PaimonCatalog, you should: 1. Download Jindo SDK: https://github.com/aliyun/alibabacloud-jindodata/blob/latest/docs/user/zh/jindosdk/jindosdk_download.md 2. copy `jindo-core.jar、jindo-sdk.jar` to `${DORIS_HOME}/fe/lib` and `${DORIS_HOME}/be/lib/java_extensions/preload-extensions` directory.
…ss to OSS-HDFS (apache#42585) Fix the problem that paimon catalog can not access to OSS-HDFS. There are 2 problems in paimon catalog: 1. Doris FE can not list paimon tables. This is because we pass these three properties -- `fs.oss.endpoint / fs.oss.accessKeyId / fs.oss.accessKeySecret` -- to the PaimonCatalog. When PaimonCatalog get these three properties, it will use `OSSLoader` rather than `HadoopFileIOLoader`. 2. Doris BE does not use libhdfs to access OSS-HDFS This is because the `tmpLocation` in `LocationPath` does not contain `oss-dls.aliyuncs`. We should use `endpoint` to judge if user wants to access OSS-HDFS What's more, if you want to access OSS-HDFS with PaimonCatalog, you should: 1. Download Jindo SDK: https://github.com/aliyun/alibabacloud-jindodata/blob/latest/docs/user/zh/jindosdk/jindosdk_download.md 2. copy `jindo-core.jar、jindo-sdk.jar` to `${DORIS_HOME}/fe/lib` and `${DORIS_HOME}/be/lib/java_extensions/preload-extensions` directory.
Fix the problem that paimon catalog can not access to OSS-HDFS.
There are 2 problems in paimon catalog:
Doris FE can not list paimon tables.
This is because we pass these three properties --
fs.oss.endpoint / fs.oss.accessKeyId / fs.oss.accessKeySecret
-- to the PaimonCatalog. When PaimonCatalog get these three properties, it will useOSSLoader
rather thanHadoopFileIOLoader
.Doris BE does not use libhdfs to access OSS-HDFS
This is because the
tmpLocation
inLocationPath
does not containoss-dls.aliyuncs
. We should useendpoint
to judge if user wants to access OSS-HDFSWhat's more, if you want to access OSS-HDFS with PaimonCatalog, you should:
jindo-core.jar、jindo-sdk.jar
to${DORIS_HOME}/fe/lib
and${DORIS_HOME}/be/lib/java_extensions/preload-extensions
directory.