Comments (1)
Normalized Code is provided here:
https://github.com/microsoft/CodeXGLUE/blob/main/Text-Code/NL-code-search-Adv/dataset.zip.
You can find the normalized code in "function_tokens" field of test_code.jsonl
. I list one line here:
{"url": "https://github.com/soimort/you-get/blob/b746ac01c9f39de94cac2d56f665285b0523b974/src/you_get/extractors/miomio.py#L41-L51", "sha": "b746ac01c9f39de94cac2d56f665285b0523b974", "docstring_summary": "str->list\n Convert XML to URL List.\n From Biligrab.", "language": "python", "parameters": "(xml_data)", "return_statement": "return rawurl", "argument_list": "", "function_tokens": ["def", "Func", "(", "arg_0", ")", ":", "arg_1", "=", "[", "]", "arg_2", "=", "parseString", "(", "arg_0", ")", "for", "arg_3", "in", "arg_2", ".", "getElementsByTagName", "(", "'durl'", ")", ":", "arg_4", "=", "arg_3", ".", "getElementsByTagName", "(", "'url'", ")", "[", "0", "]", "arg_1", ".", "append", "(", "arg_4", ".", "childNodes", "[", "0", "]", ".", "data", ")", "return", "arg_1"], "function": "def Func(arg_0):\n \"\"\"str->list\n Convert XML to URL List.\n From Biligrab.\n \"\"\"\n arg_1 = []\n arg_2 = parseString(arg_0)\n for arg_3 in arg_2.getElementsByTagName('durl'):\n arg_4 = arg_3.getElementsByTagName('url')[0]\n arg_1.append(arg_4.childNodes[0].data)\n return arg_1", "path": "src/you_get/extractors/miomio.py", "identifier": "sina_xml_to_url_list", "docstring": "str->list\n Convert XML to URL List.\n From Biligrab.", "docstring_tokens": ["str", "-", ">", "list", "Convert", "XML", "to", "URL", "List", ".", "From", "Biligrab", "."], "nwo": "soimort/you-get", "score": 0.9997601519430084}
from codexglue.
Related Issues (20)
- How to complete in real time by CodeCompletion?
- Is there a python library to use code BLEU? HOT 1
- 403 Forbidden error for Code-To-Text data files HOT 6
- Question related to fine tuning pretrained models for Defect-Detection task
- clone detection reproduction,CodeBert pipeline MAP@R only 76.64
- The CodeBlue evaluation script about code-to-code translation
- About CodeBLEU
- not a mach-o file when run code bleu
- javascript keys for CodeBLEU HOT 1
- [Code Completion - Token level] About eval_acc function
- [codecompletion-token] split function in code/dataset.py
- Missing data in ConCode HOT 2
- this line is wrong HOT 2
- The mlm and mlm_probability arguments in the run.py are not effective.
- Save_total_limit argument not used in run.py
- When resuming from a saved checkpoint, the train_dataloader doesn't resume from the same saved step.
- idx_file.txt is not effectively updated with the current epoch.
- Convert gradient accumulation with Accelerate
- no test file of webquery_predictions
- CloseTesting answer dataset is empty
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from codexglue.