alteryx / featuretools_sql Goto Github PK
View Code? Open in Web Editor NEWAutomated creation of EntitySets from relational data stored in SQL databases
License: BSD 3-Clause "New" or "Revised" License
Automated creation of EntitySets from relational data stored in SQL databases
License: BSD 3-Clause "New" or "Revised" License
I should be able to use the select_only
parameter when calling get_entityset
, not just when calling populate_dataframes
Suppose user has 100 tables, and wants to include all but n
. It may be useful to have a do_not_select
parameter instead of having to enumerate the 100-n
tables they do want in the select_only
argument.
Write blog post introducting featuretools-sql
library
We should support converting the SQL tables into Dask DataFrames; we can use the functions detailed below:
filter
parameter that the user can use to specify which tables they want to import from. Update the test cases to add tests that cover this case.featuretools
sample data, and allow user to run tests on thatcomplete =
tensorflow >= 1.14.0; sys_platform!="darwin" or platform_machine!='arm64'
tensorflow-metal >= 0.4.0; sys_platform=="darwin" and platform_machine=='arm64'
tensorflow-macos >= 2.8.0; sys_platform=="darwin" and platform_machine=='arm64'
tensorflow_hub >= 0.4.0
After releasing on conda-forge
(#6) , we should update Featuretools's install.md
to reflect the availability of featuretools_sql
on conda-forge
We may be able to achieve significant performance boosts by replacing read_sql_query
with DuckDB's API. For more info, refer here: https://duckdb.org/2021/05/14/sql-on-pandas.html
-Add relationship information for retail dataset to Testing Postgres Database (Retail Demo for all)
-Test that number of output dataframes match input dataframes
-Test types match (this might fail because WW types aren't the same)
-Pull out EntitySet setup and tear down into a reproduceable mechanism that we can reuse in other tests
-Ensure relationship information matches
relationships
data structure will fail. What is the best behavior in this scenario? For now, we can throw an informational warning and leave it to the user to set a primary key.Pandas's IO library is throwing warnings when we use it with a non-SQLAlchemy connector object. Furthermore, there seems to be issues with snowflake.connector objects handling particular data types.
We should decide how we want to resolve this issue, either by working around it or switching to SQLAlchemy.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.