You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
cuDF (pronounced "KOO-dee-eff") is a GPU DataFrame library for loading, joining, aggregating, filtering, and otherwise manipulating data.
Test:
importcudftips_df=cudf.read_csv("https://github.com/plotly/datasets/raw/master/tips.csv")
tips_df["tip_percentage"] =tips_df["tip"] /tips_df["total_bill"] *100# display average tip by dining party sizeprint(tips_df.groupby("size").tip_percentage.mean())
Hi! I came across this issue due to the cuDF reference. I work on cuDF and other RAPIDS projects at NVIDIA.
In addition to being a GPU library, cuDF can provide zero code change GPU-acceleration for pandas and (as of yesterday) Polars.
%load_extcudf.pandas# or via command line for Python scriptsdf=pd.read_parquet(filepath)
(df[["Registration State", "Violation Description"]]
.value_counts()
.groupby("Registration State")
.head()
.sort_index()
)
Would love to see these capabilities available for LEAN users. Happy to try to help answer any questions that might come up if you or anyone else explores this.
AlexCatarino
changed the title
Library Request: cuDF
Library Request: cuDF + RAPIDS
Sep 25, 2024
cuDF (pronounced "KOO-dee-eff") is a GPU DataFrame library for loading, joining, aggregating, filtering, and otherwise manipulating data.
Test:
Gives us:
EDIT: We need to install RAPIDS too.
Checklist
master
branchThe text was updated successfully, but these errors were encountered: