| --- |
| license: apache-2.0 |
| datasets: |
| - wikisql |
| language: |
| - en |
| pipeline_tag: text2text-generation |
| tags: |
| - nl2sql |
| widget: |
| - text: "question: get people name with age less 25 table: id, name, age" |
| example_title: "less than" |
| - text: "question: get people name with age equal 25 table: id, name, age" |
| example_title: "equal" |
| --- |
| |
| new version: [LarkAI/codet5p-770m_nl2sql_oig](https://huggingface.co/LarkAI/codet5p-770m_nl2sql_oig) |
|
|
| use oig-sql dataset and support more complex sql parse |
|
|
| # How to Use |
|
|
| ```python |
| import torch |
| from transformers import AutoTokenizer, BartForConditionalGeneration |
| |
| device = torch.device('cuda:0') |
| |
| tokenizer = AutoTokenizer.from_pretrained("LarkAI/bart_large_nl2sql") |
| model = BartForConditionalGeneration.from_pretrained("LarkAI/bart_large_nl2sql").to(device) |
| |
| text = "question: get people name with age less 25 table: id, name, age" |
| inputs = tokenizer([text], max_length=1024, return_tensors="pt") |
| output_ids = model.generate(inputs["input_ids"].to(device), num_beams=self.beams, max_length=128, min_length=8) |
| response_text = tokenizer.batch_decode(output_ids, skip_special_tokens=True, clean_up_tokenization_spaces=False)[0] |
| # SELECT name FROM table WHERE age < 25 |
| ``` |
|
|
| reference: [juierror/flan-t5-text2sql-with-schema](https://huggingface.co/juierror/flan-t5-text2sql-with-schema) - fix this [discussion](https://huggingface.co/juierror/flan-t5-text2sql-with-schema/discussions/5) |
|
|
| # How to Train |
|
|
| Quick start: https://github.com/huggingface/transformers/blob/main/examples/pytorch/summarization/README.md |