Comments (5)
Thanks for finding the bug. I found the problem but may take a day or two to fix it.
from ox.
Okay, maybe sooner than a day or two. Release 1.6.6 just pushed with the fix.
from ox.
fixed, thanks
One more thing I noticed, if the orginal string has a real encoding (ascii, utf-8) as opposed to no encoding (ascii-8bit) and there is no xml declaration (with encoding), the parser yields ascii-8bit nodes.
ascii-8bit results are pretty useless, you need to know how to interpret them, since that info is available in the source (whatever is coming out of the io), I think it makes sense to preserve its encoding. ascii-8bit should be yielded only from ascii-8bit origin with no xml encoding declaration.
from ox.
There is a default encoding setting that will be used if none is specified. That is set in the default options. I could make the input string encoding be used if no default option is set and no encoding is specified in the xml. That seems reasonable. Look for it in the next release.
from ox.
Release 1.6.7 make the default the input string encoding if no default option encoding is set or the encoding is not specified in the XML prolog.
from ox.
Related Issues (20)
- Sax parser segfault in v2.14.7+ HOT 4
- `StringIO` in `Ox::Builder` HOT 2
- Sax parsing with default encoding set to UTF-8 breaks Ox::Sax#text HOT 2
- OpenSSL::SSL::SSLContext::DEFAULT_CERT_STORE is not shareable across ractors HOT 1
- Issue with gems on M1 that depends on ox HOT 3
- test/tests.rb fails with ruby3.2.0preview3 and onwards HOT 3
- Ruby 3.2 compatibility HOT 2
- cannot load such file -- ox/ox HOT 6
- Cleaning built artifacts after installation breaks macOS HOT 4
- Incorrect ` Start End Mismatch` exception in sax parser HOT 9
- Ruby 2.7.6 LoadError: cannot load such file -- ox/ox HOT 3
- segfault in 2.14.13 -- free(): invalid pointer Aborted (core dumped) HOT 7
- Verbose output of inspect HOT 4
- Segmentation fault in sax_html since 2.14.13 HOT 12
- 2.14.15 cant be installed on windows HOT 1
- Segfault when parsing deeply nested HTML HOT 5
- `Start End Mismatch: element X not closed` errors in sax_test HOT 2
- Ox.load fails with UTF-8 characters in xml element names HOT 4
- ? in unquoted attribute value causes HTML to not be properly parsed HOT 4
- a new mode like :hash_no_attrs but with included attributes HOT 9
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ox.