GithubHelp home page GithubHelp logo

Comments (5)

Cyan4973 avatar Cyan4973 commented on April 28, 2024 1

Then what's exactly kind of case where zstd dictionary compression can actually stand out? Can I say, it's the case of many small files but don't allow to be concatenated?

Yes. It's a case where random access is required, or there is no time or memory to allow concatenation.

These cases are more common in data centers than on personal computers.

from zstd.

Cyan4973 avatar Cyan4973 commented on April 28, 2024 1

lz4 already supports it.

zlib should also support it, the interface may be less obvious though.

No idea about snappy.

from zstd.

Cyan4973 avatar Cyan4973 commented on April 28, 2024

This is highly unlikely.

There might be ways to manufacture a case to explicitly advantage the dictionary compression method, but even in this case, such a difference would end up being unimpressive.

Sort of such cheating technique, I would expect the more natural "stream the whole file" method to win every time, and sometimes by very significant margins.

from zstd.

Nor7th avatar Nor7th commented on April 28, 2024

@Cyan4973 Thanks for the very clear conclusion.

Given many files to compress, it's usually understandable that to compress the concatenating file of all can get a better compression effectiveness than compress each file one by one.

https://stackoverflow.com/questions/48145313/how-is-concatenating-files-before-compressing-more-effective-than-just-compressi

Then what's exactly kind of case where zstd dictionary compression can actually stand out? Can I say, it's the case of many small files but don't allow to be concatenated?

from zstd.

Nor7th avatar Nor7th commented on April 28, 2024

@Cyan4973 BTW, from your view, how difficult it is to expand the dictionary mechanism into other compression algorithms like snappy, lz4, gzip, zlib?

from zstd.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.