Building a Large Japanese Web Corpus for Large Language Models

arxiv.org

76 points · PaulHoule · 20 days ago


29 comments
xrd · 20 days ago
LeoPanthera · 20 days ago
I find ChatGPT (4) to have excellent Japanese skills already. It seems to produce more accurate and certainly more idiomatic translations than Google Translate does, and can even explain its own translations afterwards.

Show replies