Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Problems in chm13v2.0_RefSeq_Liftoff_v5.1.gff3 #95

Open
yeeus opened this issue Apr 20, 2024 · 1 comment
Open

Problems in chm13v2.0_RefSeq_Liftoff_v5.1.gff3 #95

yeeus opened this issue Apr 20, 2024 · 1 comment

Comments

@yeeus
Copy link

yeeus commented Apr 20, 2024

I have used liftoff using the annotation file of GRCh38.p14 and liftoff was running normally without any problems. But when I used chm13v2.0_RefSeq_Liftoff_v5.1.gff3 (after some tests, I found it was caused by chromosome Y), I got an error (the gff_file was chrY.gff):

>>> feature_db = gffutils.create_db(gff_file, gff_file + "_db", merge_strategy="create_unique", force=True, disable_infer_transcripts=True, disable_infer_genes=True, verbose=True)
2024-04-20 15:22:46,577 - INFO - Populating features
Traceback (most recent call last):t-order relations: 13000 features
  File "~/mambaforge/envs/liftoff/lib/python3.10/site-packages/gffutils/create.py", line 622, in _populate_from_lines
    self._insert(f, c)
  File "~/mambaforge/envs/liftoff/lib/python3.10/site-packages/gffutils/create.py", line 566, in _insert
    cursor.execute(constants._INSERT, feature.astuple())
sqlite3.IntegrityError: UNIQUE constraint failed: features.id

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "~/mambaforge/envs/liftoff/lib/python3.10/site-packages/gffutils/create.py", line 1401, in create_db
    c.create()
  File "~/mambaforge/envs/liftoff/lib/python3.10/site-packages/gffutils/create.py", line 543, in create
    self._populate_from_lines(self.iterator)
  File "~/mambaforge/envs/liftoff/lib/python3.10/site-packages/gffutils/create.py", line 656, in _populate_from_lines
    self._insert(f, c)
  File "~/mambaforge/envs/liftoff/lib/python3.10/site-packages/gffutils/create.py", line 566, in _insert
    cursor.execute(constants._INSERT, feature.astuple())
sqlite3.IntegrityError: UNIQUE constraint failed: features.id

Do you have some ideas?

@arangrhie
Copy link
Collaborator

Hi @yeeus, there were a couple TSPY genes that we had to manually curate, which might have caused problems. I don't have a good solution for this, perhaps you could manually adjust the feature id?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants