Comments (4)
This is due to ambiguity with the single tuple version of predict added for preprocessing when there is a single raw feature argument. This used to work a bit by accident, as the smallint[] and broader feature float4[] were both flattened into a single array. I'll look at what we can do to disambiguate, it maybe as easy as adding explicit predict functions for each type of numeric feature array to provide exact matches and disambiguate, but I'd rather see if we can do something smarter, to disambiguate based on the preprocessing directives, in addition to the types.
from postgresml.
any progress on this? :) it makes using preprocessing feature rather unusable
from postgresml.
I'm worried about this ambiguity in the APIs, since you mention it makes preprocessing unusable. To clarify, predict
is overloaded, in non equivalent ways for preprocessing.
- Passing an
ARRAY
(now of any of the common numeric types, not just FLOAT4), will bypass preprocessing and operate directly on the raw features in the array. - Passing a Postgres row type which is differentiated by
()
instead ofARRAY[]
.
The syntax distinction is subtle enough that people may overlook it, and call the wrong version. I'm curious if this was biting you, or just the areas in the notebooks that were not explicitly cast to FLOAT4?
from postgresml.
Thanks for the update, not just the stuff from the notebooks, thats what I tried in the end to make sure I was not doing something wrong in my own project. I try to run some predictions on my own data but it needs to be preprocessed because of the NULLs and categorical variables. I will try now with the adjusted api.rs hope it works.
from postgresml.
Related Issues (20)
- Disadvantages of backup and recovery HOT 2
- An error occurred: pool timed out while waiting for an open connection
- Train Error HOT 2
- ImportError: cannot import name 'Tensor' from 'torch' (unknown location) HOT 3
- open a new issue since the Train Error #1296 problem has been closed. HOT 1
- Documentation never states the extension name HOT 2
- After using pgml.dump_all and pgml.load_all for data backup and migration, an error occurs when trying to train on the new database.
- Error occurs during pgml.train with passive_aggressive algorithm
- rust-xgboost doesn't build on Mac OS HOT 1
- Improvement Suggestion for predictor_proba Function in sklearn.py HOT 1
- pgml.train does not properly escape relation_name HOT 2
- pgvector missing in Docker image ghcr.io/postgresml/postgresml:2.8.2 HOT 2
- wrong SQL SELECT in readme
- wrong SQL CREATE INDEX in readme
- Error in "pgml.transform" with "text2text-generation" and "bigscience/T0" HOT 2
- Docker build process fails because postgresml version 2.7.13 is no longer found at apt.postgresml.org HOT 2
- Question answering on a table HOT 1
- Support for `mixedbread-ai/mxbai-embed-large-v1`
- Split transformers.py into multiple python files HOT 1
- Potential performance issue: Unreliable performance of .loc in pandas 2.0.3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from postgresml.