-
Notifications
You must be signed in to change notification settings - Fork 22
Make column-based DataFrame ? #20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Any news/progress about this maybe? |
I had started working on a more traditional (i.e. column based) data frame library here. I didn't had much time recently to work on that, so it is still very far away from being usable, but I have plans to pick this up again soon(ish). |
Sorry to bring this up again (one whole year after the original post :)), but is there any progress with this? It seems that |
@narimiran This is also still very high on my wishlist, but I'm just not sure when I will find the time. With kadro I'm still experimenting how I can get dynamically typed columns right (and probably use Arraymancer under the hood). Overall NimData and kadro address different use cases: NimData is iterator based with support for statically typed schemas, kadro is in-memory with dynamically typed schemas. |
What do you think about creating a DataFrame type that's more column-based (list of Column types) rather than row-based (seq[T] of tuples based on schema)?
Taking some of the example operations from
example_01.nim
:This (and other numeric operations on an individual column) should be faster because we could return a shallowCopy view of the DF's column rather than creating a new MappedDataFrame (copying the needed data) and then running the summary function on the new dataframe.
I'd like to try and do this as part of Hacktoberfest.
The text was updated successfully, but these errors were encountered: