Comments (2)
The X indicates that it is the values, the Y indicates the labels. So for example, the first entry in "something_X" represents a piece of code, which consists of several code tokens, which are represented each with their vector representation. So the first entry in "something_X" is a list of vectors.
The first entry in "something_Y" is the label, a binary number, indicating whether the aforementioned code snippet contains a vulnerability or not.
The datasets in this form are used to evaluate the LSTM model, which is why they are called "finaltest", as opposed to "train" and "validate" (which are used for initial training, and for the testing that was done to fine-tune the hyperparameters).
They are for example used in trymodel.py, where the datasets are loaded and the LSTM model is used to make predictions, and then those predictions are compared to the actual labels to see how well the model is doing its job.
I hope this helps :)
from vulnerabilitydetection.
I fully understand! Thank you for your detailed guidance:)
from vulnerabilitydetection.
Related Issues (15)
- About w2v_pythoncorpus.py HOT 2
- A question on makemodel.py HOT 4
- About hightlighting the vulnerable code snippets HOT 5
- About labeling HOT 3
- Error while testing model HOT 1
- about w2v_pythoncorpus.py HOT 2
- requirements.txt HOT 13
- Nothing HOT 3
- Common samples within training and test set
- The output of example is different
- The output of example is different
- 漏洞数据集 HOT 1
- Cannot load pretrained Word2vec model
- pydriller version
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from vulnerabilitydetection.