Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Wrong requests to LanguageTool with numbers and dates. #3325

Closed
Niklas123Niklas opened this issue Oct 11, 2023 · 0 comments
Closed

Wrong requests to LanguageTool with numbers and dates. #3325

Niklas123Niklas opened this issue Oct 11, 2023 · 0 comments

Comments

@Niklas123Niklas
Copy link

Hi there,
I am working with TeXstudio and LanguageTool server 6.2 (not the newly developed software, but the old opensource version still available). I encountered a problem in the request sent to the LanguageTool server handling numbers and dates. It seems that numbers are ignored and not sent to the server for spellchecking, leading to false error messages when writing a date like 11.10.2023. Instead of the date, only two dots are sent and result in an double punctuation error. This also leads to a case error for the following word. I assume the numbers are filtered. Is it possible to remove the filtering? Or would this create more errors than it solves when processing other text/number combinations?

Environment

  • TeXstudio: 4.6.3
  • Qt: 6.5.2
  • OS: Windows 11
  • TeX distribution: texlive

Expected behavior

Send the complete sentence including numbers to languagetool and don´t show errors for double punctuation and case at the beginnig of a sentence.

Actual behavior

grafik
Errors for double punctiuation and case at the beginning of a sentence.
I checked the packages sent from TeXstudio to Languagetool using wireshark, if this helps:
`Request:
EÈ@ÔCõÓ,T»K±P éý²POST /v2/check HTTP/1.1
Host: localhost:8081
Content-Type: text/json
Content-Length: 132
Connection: Keep-Alive
Accept-Encoding: zstd, br, gzip, deflate
Accept-Language: de-DE,en,*
User-Agent: Mozilla/5.0

language=de-DE&text=Dies%20ist%20ein%20Text%20der%20das%20Datum..%20enth%C3%A4lt%20und%20f%C3%A4lschlicherweise%20Fehler%20anzeigt.

Answer:
ES @ÔCT»L@õÔP ö.{"software":{"name":"LanguageTool","version":"6.2","buildDate":"2023-07-02 13:18:32 +0000","apiVersion":1,"premium":false,"premiumHint":"You might be missing errors only the Premium version can find. Contact us at supportlanguagetoolplus.com.","status":""},"warnings":{"incompleteResults":false},"language":{"name":"German (Germany)","code":"de-DE","detectedLanguage":{"name":"German (Germany)","code":"de-DE","confidence":0.9999979,"source":"+fallback"}},"matches":[{"message":"Zwei aufeinander folgende Punkte. Auch wenn ein Satz mit einer Abkürzung endet, endet er nur mit einem Punkt (§103 Regelwerk).","shortMessage":"Zwei aufeinanderfolgende Punkte","replacements":[{"value":"."},{"value":"â¦"}],"offset":31,"length":2,"context":{"text":"Dies ist ein Text der das Datum.. enthält und fälschlicherweise Fehler an...","offset":31,"length":2},"sentence":"Dies ist ein Text der das Datum..","type":{"typeName":"Other"},"rule":{"id":"DE_DOUBLE_PUNCTUATION","description":"Zwei aufeinanderfolgende Kommas oder Punkte","issueType":"typographical","urls":[{"value":"https://dict.leo.org/grammatik/deutsch/Rechtschreibung/Amtlich/Interpunktion/pgf101-105.html#grammarpgf103"}],"category":{"id":"PUNCTUATION","name":"Zeichensetzung"}},"ignoreForIncompleteSentence":false,"contextForSureMatch":0},{"message":"Dieser Satz fängt nicht mit einem groÃgeschriebenen Wort an.","shortMessage":"","replacements":[{"value":"Enthält"}],"offset":34,"length":7,"context":{"text":"Dies ist ein Text der das Datum.. enthält und fälschlicherweise Fehler anzeigt. ","offset":34,"length":7},"sentence":"enthält und fälschlicherweise Fehler anzeigt.","type":{"typeName":"Other"},"rule":{"id":"UPPERCASE_SENTENCE_START","description":"GroÃschreibung am Satzanfang","issueType":"typographical","urls":[{"value":"https://languagetool.org/insights/de/beitrag/gross-klein-schreibung-rechtschreibung/#1-satzanf%C3%A4nge-schreiben-wir-gro%C3%9F"}],"category":{"id":"CASING","name":"GroÃ-/Kleinschreibung"}},"ignoreForIncompleteSentence":true,"contextForSureMatch":-1}],"sentenceRanges":[[0,33],[34,79]]}`

How to reproduce

Install languagetool 6.2 from https://dev.languagetool.org/http-server.html
start it with java -cp languagetool-server.jar org.languagetool.server.HTTPServer --port 8081 --allow-origin
Test a sentence in german with a date in it, like Dies ist ein Text der das Datum 10.12.2023 enthält und fälschlicherweise Fehler anzeigt.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant