Comments (6)
@JonoYang This is highly speculative, but at first blush it looks like the file count is off, perhaps reminiscent of the missing files issue we experienced a few weeks back. As I recall, that had something to do with how the codebases for the pair of scans were defined.
Is that a fair description of that cause? And is there a chance that your eCos
scans bear some similarity to that earlier set of scans?
from deltacode.
Yes something weird is going on here; I will take a look.
from deltacode.
Looks like there is some discrepancy between files_count
and len(index)
for some scans.
Still hammering down the details but we atleast have some initial tests written that reproduce this behavior.
At first glance, it looks like there is some confusion during the index process that fails to index paths that have been aligned to ''
from deltacode.
This happens after scan alignment.
from deltacode.
Ok, have figured out the main cause: We are experience hash collisions during our file indexing.
This was expected for things like sha1
indexing etc, but I underestimated that it could happen to path as well (especially since we align_scan
etc during the delta).
So, this fix will require a bit more work than anticipated, but handling it will allow us to tackle other problems easier (moved files etc). We would have had to make this change at some point, so we are not in a bad place.
from deltacode.
Fixed with #18
from deltacode.
Related Issues (20)
- Add limited delta stats after running HOT 1
- Upgrade DeltaCode to Python 3 HOT 4
- Create DeltaCode documentation on ReadTheDocs HOT 6
- Linux and MacOs buid is showing some warnings in TravisCI HOT 3
- configure failed,why? HOT 10
- Adding Azure Pipelines HOT 1
- In the output content format ‘[ ’how to understand? HOT 2
- Configure failed with "file setup.py not found" HOT 11
- Azure Piplines seems to be filing for Windows Test Jobs
- Create objects to score scan information HOT 6
- Add function to handle loading 2 codebases. HOT 2
- Remove redundant Scan model HOT 1
- Remove redundant File model
- Add Dockerfile
- Separate csv formatted output in Deltacode
- Merge DeltaCode in ScanCode TK
- Update structure to use the https://github.com/nexB/skeleton
- RFC: DeltaCode next! and roadmap HOT 2
- Update documentation after deltacode gets merge in scancode-toolkit HOT 2
- License detection diffs are incorrect HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from deltacode.