GithubHelp home page GithubHelp logo

ribuluolwj / chatgpt-for-translation Goto Github PK

View Code? Open in Web Editor NEW

This project forked from raychanan/chatgpt-for-translation

0.0 0.0 0.0 29.62 MB

Use Python and ChatGPT for translation. 利用Python和ChatGPT完成翻译。

License: MIT License

Python 21.98% HTML 78.02%

chatgpt-for-translation's Introduction

ChatGPT for Translation | ChatGPT用于翻译

This is a simple tool that uses ChatGPT to translate text into a specified target language in a faithful way to the original. The tool takes a text file (.pdf, .txt, .md, .html and .rtf) or a folder of text files as input, and outputs a translated text file or a bilingual text file with the original and translated text side by side. Special optimization has been done especially for academic paper PDF parsing and translation.

使用ChatGPT将文本以忠于原文的方式翻译成指定的目标语言。该工具接受一个文本文件(.pdf, .txt, .md, .html.rtf)或者一个包含文本的文件夹,并生成一个直接翻译后的文本或一个双语的(并列显示原始文本和翻译文本)文本。尤其对于学术论文 PDF 解析和翻译做了特别的优化。

Use this on Google Colab (recommended). See here

Google Colab上使用这个工具(推荐)。见这里

Note: Use absolute paths instead of relative paths to process PDFs.

注意:处理 PDF 的时候使用绝对路径而不是相对路径。

Simple Example | 简单例子

git clone https://github.com/Raychanan/ChatGPT-for-Translation.git
cd ./ChatGPT-for-Translation/
pip install -r requirements.txt --quiet

python ChatGPT-translate.py --input_path=input.txt --openai_key=password

This command will translate the text in input.txt into simplified Chinese using ChatGPT. You can also specify any language you want. For example, --target_language="Japanese". See this txt as an example.

这个命令将使用ChatGPT把input.txt中的文本翻译成简体中文。你也可以指定任何你想要的语言。例如,--target_language="Japanese"。翻译后的txt文件例子见这里

Translate Folder Files | 翻译文件夹内所有的文本文件

python ChatGPT-translate.py --input_path=./folder/ --openai_key=password

Bilingual Translation Example | 双语翻译例子

python ChatGPT-translate.py --bilingual --input_path=input.txt --openai_key=password

This command will translate the text in input.txt into Simplified Chinese using ChatGPT, using 20 threads (10 by default) for translation. The output will be a bilingual text file (both txt and Word files will be generated) with the original and translated text side by side. See this txt as an example.

这个命令将使用ChatGPT把input.txt中的文本翻译成简体中文,使用20个线程(默认为5个)进行翻译。输出结果将是一个双语文本文件(会同时生成txt和Word文档),其中并列着原始文本和翻译文本。双语翻译后的txt文件例子见这里

Prerequisites | 要求

You need a OpenAI API key (https://beta.openai.com/signup/)

你需要一个OpenAI的API密钥(https://beta.openai.com/signup/)

Arguments | 可用参数

--num_threads: The number of threads to use for translation (default: 5).
--only_process_this_file_extension For example, set only_process_this_file_extension="txt"
--not_to_translate_people_names Whether or not to translate names in the text. This can be useful if you are translating academic texts. By default, names will be translated.
--not_to_translate_references By default, not to translate references.
--keep_first_two_paragraphs Keep the first three paragraphs of the original text. By default, false.

Acknowledge

PDF parser is based on scipdf project on Github. Some adjustments were done to allow users to parse PDFs without having to initialzing a server locally.

chatgpt-for-translation's People

Contributors

raychanan avatar peterdavehello avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.