Remove table names from column names for `df()` call #1256

pchunduri6 · 2023-10-05T17:36:25Z

Removing table names from the dataframe during df() call. The users can then easily load CSV files generated using EvaDB with the to_csv() call at a later time (for long-running or expensive queries).

Example:

select_query = cursor.query(
    f"SELECT * FROM {repo_name}_StargazerList;"
).df()

select_query.to_csv("stargazers_list.csv", index=False)

# Later
cursor.query(
        f"""
   CREATE TABLE IF NOT EXISTS {repo_name}_StargazerList(
   github_username TEXT(1000));
"""
    ).df()

cursor.query("LOAD CSV 'stargazers_list.csv' INTO {repo_name}_StargazerList;""").df()

Do we need the table names for any use cases? For example, for duplicate column names from two different functions - object_detector_1.labels and object_detector_2.labels?

xzdandy · 2023-10-06T18:10:22Z

No merge now. Making minor changes to PR.

…#1256) Removing table names from the `dataframe` during `df()` call. The users can then easily load CSV files generated using `EvaDB` with the `to_csv()` call at a later time (for long-running or expensive queries). Example: ``` select_query = cursor.query( f"SELECT * FROM {repo_name}_StargazerList;" ).df() select_query.to_csv("stargazers_list.csv", index=False) # Later cursor.query( f""" CREATE TABLE IF NOT EXISTS {repo_name}_StargazerList( github_username TEXT(1000)); """ ).df() cursor.query("LOAD CSV 'stargazers_list.csv' INTO {repo_name}_StargazerList;""").df() ``` Do we need the table names for any use cases? For example, for duplicate column names from two different functions - `object_detector_1.labels` and `object_detector_2.labels`? --------- Co-authored-by: Andy Xu <[email protected]> Co-authored-by: Andy Xu <[email protected]>

…#1256) Removing table names from the `dataframe` during `df()` call. The users can then easily load CSV files generated using `EvaDB` with the `to_csv()` call at a later time (for long-running or expensive queries). Example: ``` select_query = cursor.query( f"SELECT * FROM {repo_name}_StargazerList;" ).df() select_query.to_csv("stargazers_list.csv", index=False) cursor.query( f""" CREATE TABLE IF NOT EXISTS {repo_name}_StargazerList( github_username TEXT(1000)); """ ).df() cursor.query("LOAD CSV 'stargazers_list.csv' INTO {repo_name}_StargazerList;""").df() ``` Do we need the table names for any use cases? For example, for duplicate column names from two different functions - `object_detector_1.labels` and `object_detector_2.labels`? --------- Co-authored-by: Andy Xu <[email protected]> Co-authored-by: Andy Xu <[email protected]>

pchunduri6 added 2 commits October 5, 2023 13:19

drop table names from column names in output

1e327e4

Merge remote-tracking branch 'origin/staging' into csv-column-names

f2f6805

pchunduri6 requested review from xzdandy and gaurav274 October 5, 2023 17:36

pchunduri6 linked an issue Oct 5, 2023 that may be closed by this pull request

select_query.to_csv() adds table name to the csv file #1216

Closed

xzdandy approved these changes Oct 6, 2023

View reviewed changes

xzdandy assigned pchunduri6 Oct 6, 2023

xzdandy added the User Experience label Oct 6, 2023

xzdandy added this to the v0.3.8 milestone Oct 6, 2023

Andy Xu and others added 7 commits October 6, 2023 23:45

Merge branch 'staging' into csv-column-names

8fe769d

Checkpoint to swtich to ada01

e446116

Fix exsiting test

79e5544

Add comments and new test

b2b96cd

Fix documentation and comments

2ba8f97

Fix doc

874525a

Fix linter

69d21b0

xzdandy merged commit d6cb3a5 into staging Oct 7, 2023
7 checks passed

xzdandy deleted the csv-column-names branch October 7, 2023 06:14

xzdandy mentioned this pull request Oct 7, 2023

Update notebook's column name after next release #1265

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove table names from column names for `df()` call #1256

Remove table names from column names for `df()` call #1256

pchunduri6 commented Oct 5, 2023

xzdandy commented Oct 6, 2023

Remove table names from column names for df() call #1256

Remove table names from column names for df() call #1256

Conversation

pchunduri6 commented Oct 5, 2023

xzdandy commented Oct 6, 2023

Remove table names from column names for `df()` call #1256

Remove table names from column names for `df()` call #1256