Comments (8)
Which version of PostgreSQL are you using?
Can you verify that your postgres encoding is set to UTF8? Check by runnning:
$ psql stg_temp_max_figure -c "SHOW SERVER_ENCODING"
Also, can we see what encoding your database is using? List those by running:
$ psql -l
I suspect the issue is that your database is using an ascii encoding rather than unicode, as it should be. If that hypothesis is correct, you should try to rerun the tutorial after explicitly making sure the database is using unicode by dropping the database and making a new one:
$ createdb -E UTF8 stg_temp_max_figure
$ psql -l
from fonduer.
Thanks for the follow-up @Allen8838! I don't know offhand how to change default encoding, but I do suspect it's based on setting the templates to be UTF8 rather than ASCII.
Also note that Fonduer expects Postgres 9.6 or above, so there may be other issues you run into with using postgres 9.5.13, so I'd recommend upgrading if you can.
from fonduer.
Hi @Allen8838, are you using OSX or Linux? Also, is this the latest tutorials using Fonduer 0.2.3?
from fonduer.
Hi @lukehsiao, I am using Bash on Ubuntu on Windows. Yes, this is the latest tutorial using Fonduer 0.2.3.
from fonduer.
I'm not able to reproduce this on my Ubuntu machine. Can you post the full error log (preferably copy-pasted as text, rather than being a screenshot)?
from fonduer.
Please see below. It's one error log, broken into several snips.
from fonduer.
Thanks for your help. I am using version 9.5.13.
Yea, you're right about the problem. The postgres encoding is set to SQL_ASCII.
I tried the line above in creating a database but still saw that the encoding was SQL_ASCII. In case it may be helpful for others, I ended using this line instead.
! createdb -E UTF8 -T template0 stg_temp_max_figure
Which got the Parser to work.
Would you offhand know how I can change the default encoding to UTF8? I did some digging and saw that I can make the change through initdb but was unable to find this file.
Thanks again.
from fonduer.
Thank you.
from fonduer.
Related Issues (20)
- Extracting Information from tables without Borders HOT 4
- get_sentence_ngrams, get_neighbor_sentence_ngrams, same_sentence should be fonduer.utils.data_model_utils.textual?
- BBox value errors HOT 3
- Is this the right way to test the saved emmental models? HOT 5
- ReadTheDocs error HOT 4
- Featurizer.get_keys() does not honor candidate classes in context
- HTMLDocPreprocessor for PDF documents is it always required HOT 3
- How can i extract a paragraph and all associated sentences in document HOT 1
- Suggestion required: Getting error while applying Featurizer HOT 3
- Parser is not splitting multiple lines sentences properly HOT 3
- unable to read images in the pdf file HOT 8
- Tokens not aligned error when spacy < 2.3.0 HOT 3
- hOCR preprocessor not available in latest release despite documentation suggesting othwerwise HOT 2
- Parser can't handle big tables? HOT 3
- Its dead slow with Win10 + PY 3.6 HOT 2
- HOCRParser fails to multiline Japanese strings HOT 2
- UDF hangs with no exception / warning HOT 5
- Tables aren't redefined for re-runs of UDF apply HOT 5
- Test code "test_postgres.py" failes with sqlalchemy delete method
- CandidateExtractor doesn't scale for larger relations HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from fonduer.