Skip to content

v0.7

Compare
Choose a tag to compare
@J535D165 J535D165 released this 10 Sep 06:56
· 44 commits to main since this release
e0f44da

What's Changed

  • Enhance errors for repositories that throw 403 errors by @J535D165 in #50
  • Add support for DOIs pointing to single files by @J535D165 in #51

Full Changelog: v0.6...v0.7

Coverage report

The following benchmark was applied to 500 randomly selected records from Datacite.

Percentages

Percentage of datasets supported: 18.6%
Percentage of datasets not supported: 75.0%
Percentage of datasets with error: 6.4%

Table with unexpected errors

id type url service error
9 10.48448/kgfs-s492 dois https://underline.io/lecture/50210-findings-thai-nested-named-entity-recognition-corpus nan 500 Server Error: Internal Server Error for url: https://underline.io/lecture/50210-findings-thai-nested-named-entity-recognition-corpus
64 10.7910/dvn/ghcv1g/bbucjs dois https://dataverse.harvard.edu/file.xhtml?persistentId=doi:10.7910/DVN/GHCV1G/BBUCJS nan Failed to parse URL 'https://dataverse.harvard.edu/loginpage.xhtml;jsessionid=de4d68eca12a3479d7a636cd6d83?redirectPage=%2Ffile.xhtml%3FpersistentId%3Ddoi%3A10.7910%2FDVN%2FGHCV1G%2FBBUCJS'
73 10.20345/digitue.1029.61 dois http://idb.ub.uni-tuebingen.de/opendigi/litrdsch_1902#p=141 nan 500 Server Error: Internal Server Error for url: https://idb.ub.uni-tuebingen.de/opendigi/litrdsch_1902#p=141
81 10.7916/d8-qcx3-yp94 dois https://dlc.library.columbia.edu/resolve/10.7916/d8-qcx3-yp94 nan 500 Server Error: Internal Server Error for url: https://dlc.library.columbia.edu/catalog/10.7916/d8-qcx3-yp94
96 10.17876/plate/dr.2/plates/201_33742 dois https://www.plate-archive.org/objects/dr.2/plates/201_33742 nan 500 Server Error: Internal Server Error for url: https://www.plate-archive.org/objects/dr.2/plates/201_33742/
128 10.25560/78890 dois http://spiral.imperial.ac.uk/handle/10044/1/78890 nan 404 Client Error: for url: https://spiral.imperial.ac.uk/rest/handle/10044/1
146 10.15496/publikation-32226 dois https://publikationen.uni-tuebingen.de/xmlui/handle/10900/90845 nan 403 Client Error: Forbidden for url: https://publikationen.uni-tuebingen.de/rest/handle/10900/90845
163 10.34755/irok.2022.72.26.033 dois https://www.elibrary.ru/item.asp?id=48800309&pff=1 nan ('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))
170 10.25673/opendata2-168870 dois https://opendata2.uni-halle.de//handle/1516514412012/168894 nan 404 Client Error: for url: https://opendata2.uni-halle.de/rest/handle/1516514412012/168894
200 10.23725/akhp-6959 dois https://ors.datacite.org/doi:/10.23725/akhp-6959 nan HTTPSConnectionPool(host='ors.datacite.org', port=443): Max retries exceeded with url: /doi:/10.23725/akhp-6959 (Caused by NameResolutionError("<urllib3.connection.HTTPSConnection object at 0x7f1549b33710>: Failed to resolve 'ors.datacite.org' ([Errno -2] Name or service not known)"))
202 10.48370/ofd/clo2tl/a0ndzw dois https://dataverse.openforestdata.pl/file.xhtml?persistentId=doi:10.48370/OFD/CLO2TL/A0NDZW nan list index out of range
252 10.14469/ch/129258 dois https://spectradspace.lib.imperial.ac.uk:8443/dspace/handle/10042/134211 nan HTTPSConnectionPool(host='spectradspace.lib.imperial.ac.uk', port=8443): Max retries exceeded with url: /dspace/handle/10042/134211 (Caused by SSLError(SSLError(1, '[SSL: SSLV3_ALERT_HANDSHAKE_FAILURE] sslv3 alert handshake failure (_ssl.c:1006)')))
257 10.25673/opendata2-91140 dois https://opendata2.uni-halle.de//handle/1516514412012/91154 nan 404 Client Error: for url: https://opendata2.uni-halle.de/rest/handle/1516514412012/91154
258 10.14469/ch/41814 dois https://spectradspace.lib.imperial.ac.uk:8443/dspace/handle/10042/48213 nan HTTPSConnectionPool(host='spectradspace.lib.imperial.ac.uk', port=8443): Max retries exceeded with url: /dspace/handle/10042/48213 (Caused by SSLError(SSLError(1, '[SSL: SSLV3_ALERT_HANDSHAKE_FAILURE] sslv3 alert handshake failure (_ssl.c:1006)')))
296 10.14457/cmu.the.2009.132 dois http://doi.nrct.go.th/?page=resolve_doi&resolve_doi=10.14457/CMU.the.2009.132 nan HTTPSConnectionPool(host='doi.nrct.go.th', port=443): Read timed out. (read timeout=3)
297 10.6085/aa/lop001_026mtbd004r00_20100911.40.1 dois https://data.piscoweb.org/catalog/d1/mn/v1/object/doi:10.6085/AA/LOP001_026MTBD004R00_20100911.40.1 nan Failed to parse URL 'https://data.piscoweb.org/metacat/d1/mn/v1/object/doi:10.6085/AA/LOP001_026MTBD004R00_20100911.40.1'
316 10.14469/ch/90617 dois https://spectradspace.lib.imperial.ac.uk:8443/dspace/handle/10042/97675 nan HTTPSConnectionPool(host='spectradspace.lib.imperial.ac.uk', port=8443): Max retries exceeded with url: /dspace/handle/10042/97675 (Caused by SSLError(SSLError(1, '[SSL: SSLV3_ALERT_HANDSHAKE_FAILURE] sslv3 alert handshake failure (_ssl.c:1006)')))
321 10.14456/apsr.2022.3 dois http://doi.nrct.go.th/?page=resolve_doi&resolve_doi=10.14456/apsr.2022.3 nan HTTPSConnectionPool(host='doi.nrct.go.th', port=443): Read timed out. (read timeout=3)
324 10.25646/886 dois https://edoc.rki.de/handle/176904/961 nan 403 Client Error: Forbidden for url: https://edoc.rki.de/rest/handle/176904/961
337 10.18419/darus-3307/17 dois https://darus.uni-stuttgart.de/file.xhtml?persistentId=doi:10.18419/darus-3307/17 nan list index out of range
394 10.5287/bodleianjpcy.2 dois https://databank.ora.ox.ac.uk/ww1archives/datasets/ww1-3945?version=2 nan HTTPSConnectionPool(host='databank.ora.ox.ac.uk', port=443): Max retries exceeded with url: /ww1archives/datasets/ww1-3945?version=2 (Caused by ConnectTimeoutError(<urllib3.connection.HTTPSConnection object at 0x7f15496741d0>, 'Connection to databank.ora.ox.ac.uk timed out. (connect timeout=3)'))
408 10.6085/aa/sh15xx_015ctdx015r00_20150622.42.1 dois https://data.piscoweb.org/catalog/d1/mn/v1/object/doi:10.6085/AA/SH15XX_015CTDX015R00_20150622.42.1 nan Failed to parse URL 'https://data.piscoweb.org/metacat/d1/mn/v1/object/doi:10.6085/AA/SH15XX_015CTDX015R00_20150622.42.1'
414 10.17170/kobra-202007241490 dois https://kobra.uni-kassel.de/handle/123456789/11715 nan 404 Client Error: for url: https://kobra.uni-kassel.de/rest/handle/123456789/11715
420 10.6085/aa/shb001_021mxti012r00_19991203.40.1 dois https://data.piscoweb.org/catalog/d1/mn/v1/object/doi:10.6085/AA/SHB001_021MXTI012R00_19991203.40.1 nan Failed to parse URL 'https://data.piscoweb.org/metacat/d1/mn/v1/object/doi:10.6085/AA/SHB001_021MXTI012R00_19991203.40.1'
421 10.57967/hf/0034 dois https://huggingface.co/datasets/rdp-studio/paimon-voice nan 'HuggingFaceDataset' object has no attribute 'API_URL_META'
433 10.6085/aa/srsxxx_015mtbd009r00_20030606.40.2 dois https://data.piscoweb.org/catalog/d1/mn/v1/object/doi:10.6085/AA/SRSXXX_015MTBD009R00_20030606.40.2 nan Failed to parse URL 'https://data.piscoweb.org/metacat/d1/mn/v1/object/doi:10.6085/AA/SRSXXX_015MTBD009R00_20030606.40.2'
434 10.25316/ir-3643 dois https://viurrspace.ca/handle/10613/9170 nan 403 Client Error: Forbidden for url: https://viurrspace.ca/rest/handle/10613/9170
470 10.25316/ir-13646 dois https://viurrspace.ca/handle/10613/21545 nan 403 Client Error: Forbidden for url: https://viurrspace.ca/rest/handle/10613/21545
480 10.21256/zhaw-1700 dois https://digitalcollection.zhaw.ch/handle/11475/3157 nan 404 Client Error: 404 for url: https://digitalcollection.zhaw.ch/rest/handle/11475/3157
481 10.34628/w74t-gn74 dois http://hdl.handle.net/11067/89 nan HTTPSConnectionPool(host='repositorio.ulusiada.pt', port=443): Max retries exceeded with url: /rest/handle/11067/89 (Caused by SSLError(SSLEOFError(8, '[SSL: UNEXPECTED_EOF_WHILE_READING] EOF occurred in violation of protocol (_ssl.c:1006)')))
487 10.6084/m9.figshare.c.3618695 dois https://figshare.com/collections/Transcriptome_analyses_of_seed_development_in_grape_hybrids_reveals_a_possible_mechanism_influencing_seed_size/3618695 nan Failed to parse URL 'https://figshare.com/collections/Transcriptome_analyses_of_seed_development_in_grape_hybrids_reveals_a_possible_mechanism_influencing_seed_size/3618695'
493 10.7916/d8-47rs-s759 dois https://dlc.library.columbia.edu/resolve/10.7916/d8-47rs-s759 nan 500 Server Error: Internal Server Error for url: https://dlc.library.columbia.edu/catalog/10.7916/d8-47rs-s759