no transcript_ids produced in the gtf and the transcript tables

Dear IsoTools Team,

I have noticed that the tool does not appear to output specific transcript IDs when a single gene has multiple known isoforms. Because the resulting expression table only displays values aggregated at the gene level, it is difficult to determine which specific isoform is being expressed.

Initially, I suspected this might be an issue unique to my dataset or potentially caused by a misconfiguration in my annotation GTF file. However, after testing the provided demo_data using the standard import workflow below, I encountered the exact same behavior:

```python
# integrate the samples
for i,row in samples.iterrows():
    # this step takes about 5-30 seconds per sample
     isoseq.add_sample_from_bam(row.file_name, sample_name=row.sample_name, group=row.group)
# the sample table of the transcriptome object contains the number of imported reads
isoseq.sample_table
``` 

Given this layout, what is the recommended way to retrieve isoform/transcript-level quantification? Furthermore, would a fix for this require adjusting the pipeline downstream, or should transcript-level assignments be handled prior to the alignment step?

Thank you for your time and help!

Best regards,

Amandeep

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

no transcript_ids produced in the gtf and the transcript tables #32

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

no transcript_ids produced in the gtf and the transcript tables #32

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions