Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HTML export handles some UTF characters wrong #1484

Closed
ThomasA opened this issue Jun 8, 2016 · 5 comments · Fixed by #1545
Closed

HTML export handles some UTF characters wrong #1484

ThomasA opened this issue Jun 8, 2016 · 5 comments · Fixed by #1545
Assignees
Labels
bug Confirmed bugs or reports that are very likely to be bugs

Comments

@ThomasA
Copy link
Member

ThomasA commented Jun 8, 2016

JabRef 3.4 on Ubuntu 16.04

Steps to reproduce:

  1. Copy and paste the following test entry into a new database in JabRef.
  2. Make sure the database is stored as UTF-8 encoded (should be default).
  3. Export the database and select the file type 'HTML (*.html)'
  4. In the resulting HTML file, the character 'Ø' is displayed wrong in the Firefox browser.
    jabref
    firefox

This does not happen when exporting to 'Simple HTML', 'HTML list', 'HTML table', or 'HTML table (with Abstract and BibTeX)'.

@lenhard
Copy link
Member

lenhard commented Jun 8, 2016

Thanks for the report!

@oscargus: What do you think about it?

@stefan-kolb stefan-kolb added the bug Confirmed bugs or reports that are very likely to be bugs label Jun 8, 2016
@oscargus
Copy link
Contributor

Sorry for late reply. My guess is that the export filter does not run the Unicode to HTML converter for unknown reasons, since it works for the other HTML variants. I will not have time this week to look more into it.

@Siedlerchr
Copy link
Member

I tested a bit around and a simple workaround would be to set the charset in the html documents. When no one is set, it is treated default as ASCII (at least in FF)
For UTF8 it would be enough to add:
<meta charset="utf-8" />
directly after the html open tag.
Then no escaping is necessary.

Despite that, it seems that on Export no Formatter is run.

@oscargus
Copy link
Contributor

oscargus commented Jul 4, 2016

I checked this a bit more now and @Siedlerchr is completely correct. The missing thing is the charset definition.

JabRef does not have a Unicode to HTML converter, only Unicode to/from LaTeX and HTML to/from LaTeX.

I'll soon be committing a fix, adding the charset.

@oscargus
Copy link
Contributor

oscargus commented Jul 4, 2016

Ideally one should add the charset actually used for the file. However, this is not possible with the current export functionality as far as I can tell.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Confirmed bugs or reports that are very likely to be bugs
Projects
None yet
Development

Successfully merging a pull request may close this issue.

5 participants