Support partitionBy with Rikai dataframe writer in Spark #635

eddyxu · 2022-04-19T21:08:14Z

df.write.rikai() supports partitionBy(col, ...)

eddyxu · 2022-04-19T21:10:31Z

Closes #634

changhiskhan · 2022-04-19T21:16:51Z

src/main/scala/ai/eto/rikai/RikaiOptions.scala

+      case Some(cols) =>
+        Some(
+          cols
+            .substring(1, cols.length - 1)


why do we need this subtring here?

The value in parameters: Map[string, string] is something like

Map(__partition_columns -> ["label"], path -> /var/folders/m1/vyb5yj4n5cb17gv371g09k280000gn/T/rikai6565224676268032985/dataset)

the string value includes double quote

use partition with Rikai format

4c67597

changhiskhan reviewed Apr 19, 2022

View reviewed changes

changhiskhan approved these changes Apr 19, 2022

View reviewed changes

eddyxu merged commit 70e1ccb into eto-ai:main Apr 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support partitionBy with Rikai dataframe writer in Spark #635

Support partitionBy with Rikai dataframe writer in Spark #635

eddyxu commented Apr 19, 2022 •

edited

Loading

eddyxu commented Apr 19, 2022

changhiskhan Apr 19, 2022

eddyxu Apr 19, 2022

Support partitionBy with Rikai dataframe writer in Spark #635

Support partitionBy with Rikai dataframe writer in Spark #635

Conversation

eddyxu commented Apr 19, 2022 • edited Loading

eddyxu commented Apr 19, 2022

changhiskhan Apr 19, 2022

Choose a reason for hiding this comment

eddyxu Apr 19, 2022

Choose a reason for hiding this comment

eddyxu commented Apr 19, 2022 •

edited

Loading