GithubHelp home page GithubHelp logo

Comments (7)

reikoch avatar reikoch commented on June 18, 2024 1

Excellent - it works for my test files!
Thanks lots!

from readstat.

evanmiller avatar evanmiller commented on June 18, 2024

Notes for myself:

It appears that a subheader pointer in bar.sas7bdat is pointing to compressed data, but the pointer itself indicates that the data is not compressed. (The byte at offset 5809096 is 0x00, whereas all of the other pointers in the block have the value 0x04.) In a way the issue is the converse of #37. I'm not sure if the compression bit is stored elsewhere or if a workaround is required here.

from readstat.

reikoch avatar reikoch commented on June 18, 2024

If it helps I can produce a really char compressed dataset, not just one labelled so. I see problems with char compressed datasets so often that I do not believe it to be restricted to "wrong" labelling.

from readstat.

evanmiller avatar evanmiller commented on June 18, 2024

The data you provided is indeed compressed. The "mislabeling" occurs for a short segment of data deep in the file. The trouble occurs because a compressed file can contain both compressed and uncompressed chunks, and these individual chunks are sometimes mislabeled (apparently).

from readstat.

reikoch avatar reikoch commented on June 18, 2024

Most char compressed SAS datasets can now be read by development version 0.2.0.9000 of haven. But the simple example ietest2.sas7bdat in [https://github.com/reikoch/testfiles/blob/master/ietest2.sas7bdat] returns for read_sas("ietest2.sas7bdat"):

ReadStat: Error parsing page 0, bytes 8192-16383

Error: Failed to parse /opt/BIOSTAT/home/kochr4/sas7bdat/ietest2.sas7bdat: Invalid file, or file has unsupported features.

from readstat.

reikoch avatar reikoch commented on June 18, 2024

It looks like Python module sas7bdat from [https://bitbucket.org/jaredhobbs/sas7bdat] has solved decoding of SAS compressed datasets...

from readstat.

evanmiller avatar evanmiller commented on June 18, 2024

Fixed in c92f697

Please re-open the issue if you are still experiencing problems.

from readstat.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.