GithubHelp home page GithubHelp logo

GzDecoder seem decode incorrect about libflate HOT 9 CLOSED

sile avatar sile commented on June 16, 2024
GzDecoder seem decode incorrect

from libflate.

Comments (9)

axetroy avatar axetroy commented on June 16, 2024 1

Here is the source code https://github.com/axetroy/cask.rs/blob/main/src/extractor.rs

I tried to give a minimum implementation for reproduce

from libflate.

sile avatar sile commented on June 16, 2024

Thank you for reporting this issue.
However, I could not reproduce your problem.

I wrote the following code that deflates the above input file then prints the original size:

fn main() -> anyhow::Result<()> {
    let tar_file = std::fs::File::open("prune_darwin_amd64.tar.gz")?;
    let mut input = libflate::gzip::Decoder::new(&tar_file)?;
    let mut output = Vec::new();
    std::io::copy(&mut input, &mut output)?;

    println!("Deflated size: {}", output.len());
    Ok(())
}

// The comment below is the output of this command.
// Deflated size: 968192

Then, I applied gzip command to deflate the file then confirmed the result file size by using ls:

$ gzip -d prune_darwin_amd64.tar.gz
$ ls -l
-rw-r--r-- 1 user user 968192 Feb 28 02:19 prune_darwin_amd64.tar

So the two results seem matched.

from libflate.

sile avatar sile commented on June 16, 2024

BTW, I could not run the code snippet you shared as I don't know where the Archive struct (or enum?) comes from.

from libflate.

axetroy avatar axetroy commented on June 16, 2024

@sile Hello, Thanks for your help.

And here is the reproduced repo: https://github.com/axetroy/libflate-66

git clone https://github.com/axetroy/libflate-66
cd ./libflate-66
cargo run ./

# View unzipped files ’pruneβ€˜

from libflate.

sile avatar sile commented on June 16, 2024

Thank you for the additional information. I could reproduce your result.

Then, I modified the code to use the file ("prune_darwin_amd64.tar") directly deflated by gzip command as follows:

fn main() -> anyhow::Result<()> {
    let extract_file_name = "prune";
    let output_file_path = "output";
    let mut archive = Archive::new(std::fs::File::open("prune_darwin_amd64.tar")?);

    archive.set_unpack_xattrs(true);
    archive.set_overwrite(true);
    archive.set_preserve_permissions(true);
    archive.set_preserve_mtime(true);

    let files = archive.entries()?;

    for entry in files {
        let mut file = entry?;

        let file_path = file.path()?;

        if let Some(file_name) = file_path.file_name() {
            dbg!(&file_name);
            if file_name.to_str().unwrap() == extract_file_name {
                file.unpack(&output_file_path)?;
                print!("unpacked");
                break;
            }
        }
    }
    Ok(())
}

The result was unchanged (i.e., the unpacked file size was 965,416). Thus this is not a problem relevant to libflate I think.

from libflate.

axetroy avatar axetroy commented on June 16, 2024

This makes me confused

I use GZ to compress the file, then decompress it, the result is correct.
I use the TAR archive, then decompress it, the result is correct.

But I combined them, the result is incorrect.

But installing the unzipping tools installed on my computer, everything is fine

from libflate.

sile avatar sile commented on June 16, 2024

FYI, Python3's tar library could handle the (already deflated) input tar file correctly.

>>> import tarfile
>>> tar = tarfile.open("prune_darwin_amd64.tar")
>>> tar.getmember("prune").size
985384

from libflate.

axetroy avatar axetroy commented on June 16, 2024

OK, it should be the difference between TAR implementation.

Thanks for your help and time. Have a good day πŸ‘

from libflate.

sile avatar sile commented on June 16, 2024

πŸ‘

from libflate.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.