Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

New entry from plain text fails #6517

Closed
1 task done
crystalfp opened this issue May 24, 2020 · 7 comments
Closed
1 task done

New entry from plain text fails #6517

crystalfp opened this issue May 24, 2020 · 7 comments
Labels
bug Confirmed bugs or reports that are very likely to be bugs fetcher status: stale

Comments

@crystalfp
Copy link

JabRef 5.1--2020-05-23--03e3663
Windows 10 10.0 amd64
Java 14.0.1

The new automatic parser of entries from text fails to extract anything from citations extracted from papers or sites. Here are two examples:

Is there life after Fortnite? An emerging trend in game design brings novelty
by Pascal Luban on 06/04/19 
<https://gamasutra.com/blogs/PascalLuban/20190604/343440/Is_there_life_after_Fortnite_An_emerging_trend_in_game_design_brings_novelty.php>

Subramony, D., Molenda, M., Betrus, A., and Thalheimer, W. (2014). The Mythical Retention Chart and the Corruption of Dale’s Cone of Experience. Educational Technology, Nov/Dec 2014, 54(6), 6-16.

With the old, manual one at least I can extract the relevant parts and then correct the record. Is it possible to have it back?

Steps to reproduce the behavior:

  1. Copy one of the citations above
  2. in JR "New entry from plain text"
  3. Paste and push "Add to current library"
  4. Nothing happens

Thanks!
mario

@crystalfp
Copy link
Author

Also using the example from the page: https://docs.jabref.org/collect/newentryfromplaintext gives "0 entries were parsed from your query" with JabRef 5.1--2020-06-04--7cc5747 (latest available today)

To diagnose the problem I started JabRef.bat --debug from the portable installation. After other errors not directly related to my problem, here is the output when I ask to parse the entry from the example page:

%d [%thread] %-5level %logger - %msg%n%d [%thread] %-5level %logger - %msg%n%d [%thread] %-5level %logger - %msg%n%d [%thread] %-5level %logger - %msg%n%d [%thread] %-5level %logger - %msg%n%d [%thread] %-5level %logger - %msg%n java.net.ConnectException: Connection refused: connect
        at sun.nio.ch.Net.connect0(Native Method) ~[?:?]
        at sun.nio.ch.Net.connect(Unknown Source) ~[?:?]
        at sun.nio.ch.Net.connect(Unknown Source) ~[?:?]
        at sun.nio.ch.NioSocketImpl.connect(Unknown Source) ~[?:?]
        at java.net.Socket.connect(Unknown Source) ~[?:?]
        at java.net.Socket.connect(Unknown Source) ~[?:?]
        at sun.net.NetworkClient.doConnect(Unknown Source) ~[?:?]
        at sun.net.www.http.HttpClient.openServer(Unknown Source) ~[?:?]
        at sun.net.www.http.HttpClient.openServer(Unknown Source) ~[?:?]
        at sun.net.www.http.HttpClient.<init>(Unknown Source) ~[?:?]
        at sun.net.www.http.HttpClient.New(Unknown Source) ~[?:?]
        at sun.net.www.http.HttpClient.New(Unknown Source) ~[?:?]
        at sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(Unknown Source) ~[?:?]
        at sun.net.www.protocol.http.HttpURLConnection.plainConnect0(Unknown Source) ~[?:?]
        at sun.net.www.protocol.http.HttpURLConnection.plainConnect(Unknown Source) ~[?:?]
        at sun.net.www.protocol.http.HttpURLConnection.connect(Unknown Source) ~[?:?]
        at sun.net.www.protocol.http.HttpURLConnection.getOutputStream0(Unknown Source) ~[?:?]
        at sun.net.www.protocol.http.HttpURLConnection.getOutputStream(Unknown Source) ~[?:?]
        at org.jabref.logic.net.URLDownload.openConnection(Unknown Source) ~[org.jabref:?]
        at org.jabref.logic.net.URLDownload.asString(Unknown Source) ~[org.jabref:?]
        at org.jabref.logic.net.URLDownload.asString(Unknown Source) ~[org.jabref:?]
        at org.jabref.logic.importer.util.GrobidService.processCitation(Unknown Source) ~[org.jabref:?]
        at org.jabref.logic.importer.fetcher.GrobidCitationFetcher.parseUsingGrobid(Unknown Source) ~[org.jabref:?]
        at org.jabref.logic.importer.fetcher.GrobidCitationFetcher.lambda$performSearch$1(Unknown Source) ~[org.jabref:?]
        at java.util.stream.ReferencePipeline$3$1.accept(Unknown Source) ~[?:?]
        at java.util.stream.ReferencePipeline$2$1.accept(Unknown Source) ~[?:?]
        at java.util.stream.ReferencePipeline$3$1.accept(Unknown Source) ~[?:?]
        at java.util.Spliterators$ArraySpliterator.forEachRemaining(Unknown Source) ~[?:?]
        at java.util.stream.AbstractPipeline.copyInto(Unknown Source) ~[?:?]
        at java.util.stream.AbstractPipeline.wrapAndCopyInto(Unknown Source) ~[?:?]
        at java.util.stream.ReduceOps$ReduceOp.evaluateSequential(Unknown Source) ~[?:?]
        at java.util.stream.AbstractPipeline.evaluate(Unknown Source) ~[?:?]
        at java.util.stream.ReferencePipeline.collect(Unknown Source) ~[?:?]
        at org.jabref.logic.importer.fetcher.GrobidCitationFetcher.performSearch(Unknown Source) ~[org.jabref:?]
        at org.jabref.gui.bibtexextractor.BibtexExtractorViewModel.lambda$startParsing$0(Unknown Source) ~[org.jabref:?]
        at org.jabref.gui.util.BackgroundTask$1.call(Unknown Source) [org.jabref:?]
        at org.jabref.gui.util.DefaultTaskExecutor$1.call(Unknown Source) [org.jabref:?]
        at javafx.concurrent.Task$TaskCallable.call(Unknown Source) [org.jabref.merged.module:?]
        at java.util.concurrent.FutureTask.run(Unknown Source) [?:?]
        at java.util.concurrent.Executors$RunnableAdapter.call(Unknown Source) [?:?]
        at java.util.concurrent.FutureTask.run(Unknown Source) [?:?]
        at java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source) [?:?]
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source) [?:?]
        at java.lang.Thread.run(Unknown Source) [?:?]
%d [%thread] %-5level %logger - %msg%n%d [%thread] %-5level %logger - %msg%n 

Seems like Grobid (the parser) tries to connect somewhere but the connection is refused. No sign of this need to connect in the documentation. Could it be the Grobid demo server http://cloud.science-miner.com/grobid/? From the browser I can connect to this page without problems.

Thanks for your help!
mario

@Siedlerchr
Copy link
Member

WE have our own grobid server, but there are also currently some issues. We are trying to investigate this. See also #6587 #6369

@Siedlerchr Siedlerchr added fetcher bug Confirmed bugs or reports that are very likely to be bugs labels Jun 7, 2020
nitram509 added a commit to nitram509/jabref that referenced this issue Oct 18, 2020
add information on GROBID server connection issue for user
relates to JabRef#6517 and JabRef#6891
Siedlerchr pushed a commit that referenced this issue Oct 18, 2020
* add explicit configuration for connectTimeout in URLDownload
add information on GROBID server connection issue for user
relates to #6517 and #6891

* add CHANGELOG entry

* fix style issues detected by checkstyle

* use .toMilliseconds() is more convenient
use assertThrow is more convenient

* incorporate feedback from review
reduce detail level from user message
rework to use FetcherException instead of UncheckedIOException

* switch to debug level in order to reduce verboseness in the log
@crystalfp
Copy link
Author

Tried various entries from article references, but no one succeeded. Maybe I'm doing something wrong, but don't understand what.
Could be useful for you if I send you all the failed references?
For now I wait for the next official release before testing again.
Thanks!

@nitram509
Copy link
Contributor

Could be useful for you if I send you all the failed references?

Thank you for the offer, @crystalfp .
Actually, the root cause is already known, so this would add no new information.

I guess the maintainers already have this on their "radar".

@humbleambition
Copy link

Fantastic, you're working on this already then.
Any idea when this is likely to get fixed?

@github-actions
Copy link
Contributor

This issue has been inactive for half a year. Since JabRef is constantly evolving this issue may not be relevant any longer and it will be closed in two weeks if no further activity occurs.

As part of an effort to ensure that the JabRef team is focusing on important and valid issues, we would like to ask if you could update the issue if it still persists. This could be in the following form:

  • If there has been a longer discussion, add a short summary of the most important points as a new comment (if not yet existing).
  • Provide further steps or information on how to reproduce this issue.
  • Upvote the initial post if you like to see it implemented soon. Votes are not the only metric that we use to determine the requests that are implemented, however, they do factor into our decision-making process.
  • If all information is provided and still up-to-date, then just add a short comment that the issue is still relevant.

Thank you for your contribution!

@dbitouze
Copy link

dbitouze commented Jun 13, 2023

  • JabRef 5.9--2023-01-08--76253f1a7
  • Linux 5.15.110-desktop-2.mga8 amd64
  • Java 19.0.1
  • JavaFX 19+11

This issue still happens, at least on macOS and on Windows (not on GNU/Linux).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Confirmed bugs or reports that are very likely to be bugs fetcher status: stale
Projects
Archived in project
Development

No branches or pull requests

5 participants