GithubHelp home page GithubHelp logo

How can I finetune on CSpider about resdsql HOT 4 CLOSED

wikty avatar wikty commented on July 23, 2024
How can I finetune on CSpider

from resdsql.

Comments (4)

lihaoyang-ruc avatar lihaoyang-ruc commented on July 23, 2024

This is an interesting issue.

Currently, our method is fine-tuned for English RoBERTa and T5, since the datasets used in our paper are in English. In fact, RESDSQL is a general method and is not limited by language. Therefore, we also want to know if RESDSQL can achieve promising performance on multilingual Text-to-SQL benchmarks such as CSpider.

You can replace the original English RoBERTa and T5 with multilingual RoBERTa and T5 that support both Chinese and English, and then fine-tune them.

Although it sounds easy, there are still potential problems that may be encountered during implementation. Good luck!

from resdsql.

wikty avatar wikty commented on July 23, 2024

Thank you very much, that sounds very reasonable!

Training on Chinese-English bilingual datasets has great significance. For example, in a real industry scene our user input is in Chinese, but the database schema is in English. I'll try your proposal idea.

I still have a little concern about preprocessing.py, does it bind with English?

from resdsql.

lihaoyang-ruc avatar lihaoyang-ruc commented on July 23, 2024

There are two potential problems.

  • First, see lines 353-360 in preprocessing.py, where we follow PICARD and use string matching to retrieve some useful values from the database based on the given English question. This step facilitates the generation of predicates in SQL queries (e.g., WHERE country = 'France'). However, this function may not work when you input Chinese questions.
  • Second, since the most powerful model RESDSQL-3B+NatSQL uses NatSQL (an intermediate representation of SQL aligned to the Spider dataset) as the seq2seq training target, I do not know if the NatSQL dataset can also be aligned to the CSpider dataset. If not, you can only fine-tune a SQL version of RESDSQL, i.e., RESDSQL-{Base, Large, 3B} in our paper.

from resdsql.

wikty avatar wikty commented on July 23, 2024

Thanks a lot, I'll try it out.

from resdsql.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.