Comments (4)
This is an interesting issue.
Currently, our method is fine-tuned for English RoBERTa and T5, since the datasets used in our paper are in English. In fact, RESDSQL is a general method and is not limited by language. Therefore, we also want to know if RESDSQL can achieve promising performance on multilingual Text-to-SQL benchmarks such as CSpider.
You can replace the original English RoBERTa and T5 with multilingual RoBERTa and T5 that support both Chinese and English, and then fine-tune them.
Although it sounds easy, there are still potential problems that may be encountered during implementation. Good luck!
from resdsql.
Thank you very much, that sounds very reasonable!
Training on Chinese-English bilingual datasets has great significance. For example, in a real industry scene our user input is in Chinese, but the database schema is in English. I'll try your proposal idea.
I still have a little concern about preprocessing.py
, does it bind with English?
from resdsql.
There are two potential problems.
- First, see lines 353-360 in
preprocessing.py
, where we follow PICARD and use string matching to retrieve some useful values from the database based on the given English question. This step facilitates the generation of predicates in SQL queries (e.g.,WHERE country = 'France'
). However, this function may not work when you input Chinese questions. - Second, since the most powerful model
RESDSQL-3B+NatSQL
uses NatSQL (an intermediate representation of SQL aligned to the Spider dataset) as the seq2seq training target, I do not know if the NatSQL dataset can also be aligned to the CSpider dataset. If not, you can only fine-tune a SQL version of RESDSQL, i.e., RESDSQL-{Base, Large, 3B} in our paper.
from resdsql.
Thanks a lot, I'll try it out.
from resdsql.
Related Issues (20)
- Execuse me. What happened to paper CodeS? Isn't this article open source before? HOT 9
- Low training metrics HOT 14
- Support for Historical Conversation in RESDSQL HOT 4
- Question about evaluation scripts HOT 2
- 请问推理方法 HOT 2
- 最低支持的GPU内存是多少,我怎么跑不起来。
- Dev result file?
- 部分带有别名的sql在经过normalization处理后出现错误 HOT 2
- Inference script not working HOT 5
- CoSQL HOT 1
- 训练Cross-Encoder的时候为什么24G的显存还不够用? HOT 1
- 关于RESDSQL在BIRD上的运行时间 HOT 2
- Training cross-coder error HOT 1
- xlm_roberta_text2natsql_schema_item_classifier HOT 3
- Evaluation detail on CSpider HOT 1
- 你好,请问如何将自己的数据集处理成CSpider的形式? HOT 3
- 你好,请问如何SQL2NatSQL?我想用自己的数据集跑text2NatSQL的方法。 HOT 2
- 请问模型训练有多gpu并行支持吗 HOT 1
- Can the ranking-filter successfully choose all the right schema items? HOT 1
- 为什么我使用对bird训练的classifier时出现了truncated_dataset.json文件,而且陷入了循环无法结束运行 HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from resdsql.