Comments (6)
The haven and sas7bdat packages currently do not support binary compression; see here for a discussion and possible workaround:
from haven.
Thanks Evan for letting me know.
from haven.
FYI, after much travail getting Java issues resolved (library sas7bdat.parso needs Java 7 and also need a Java SDK, I was able to get things to work. It would be nice not to have to go through all these steps.
from haven.
The sas7bdat.parso library wraps the "Parso" Java library. The R interface depends on the rJava package, which imposes the dependencies that you mention. Thus, the difficulty in installing sas7bdat.parso is primarily due to the difficulty in installing rJava, which is partly why I haven't uploaded this package to CRAN (I also don't want to maintain another package, especially with an rJava dependency).
However, you've highlighted a important point, that the Parso library reads both "CHAR", and "BINARY" compressed sas7bdat files. As far as I know, Parso is the only open source package that implements the latter. In fact, the details of "BINARY" compression is a major hole in the sas7bdat documentation (https://github.com/BioStatMatt/sas7bdat/blob/master/vignettes/sas7bdat.rst). Of course, all that is necessary to fill this hole is to read through the Parso source code and write down what it does with "BINARY" compressed files. I have plans to do that, but I would gratefully accept the help of anyone willing to tackle this problem.
from haven.
It's worth noting that the Parso library is dual-licensed under GPLv3 and an "ask us" commercial license. That essentially makes it toxic for ReadStat, which is MIT-licensed.
IANAL but if @BioStatMatt reads the Parso source code it may force his own code into GPLv3. You can't copyright facts, so the legal status of the vignette should remain the same, however.
from haven.
It looks like the Python sas7bdat package also supports binary compression, with the added benefit that it's MIT licensed:
https://pypi.python.org/pypi/sas7bdat
from haven.
Related Issues (20)
- bug while reading sas7bdat file HOT 4
- Vulnerability in SAS7BCAT reader HOT 2
- Progress Bar for XPT HOT 1
- Returning full variable labels HOT 1
- New feature proposal: Reads creation/modified time of SAS7BDAT HOT 3
- Unable to allocate memory when opening a dta file HOT 1
- Upkeep for haven (2023)
- Use standard argument checkers
- `write_dta()` can error if labelled values are large enough
- Zero-length label worse than null label
- Remind the novice user of labelled-format? HOT 2
- Dates in labelled_spss HOT 3
- Release haven 2.5.4
- Allow `labels` to be passed through to `labelled()` as unnamed vector of same length as `x`
- Dataset label truncated after `write_xpt` HOT 5
- `read_xpt()` fails on misconstructed dates HOT 3
- typo in docs for as_factor
- read_sav default is to NOT encode user defined data
- `haven::read_sas` generates some message
- How to solve non-ascii output for `write_xpt` HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from haven.