Skip to content

multiple GITenberg repositories for same Gutenberg id #22

@rdhyee

Description

@rdhyee

One problem I'm running into -- I had assumed that repo_name in metadata.yaml is the same as that in GITenberg repo -- but not true for some repos. eg., GITenberg/Jane-Eyre_1260@bf865df --> the origin of this problem might be the TSV I had generated: https://gist.github.com/rdhyee/3c9195d639223ce5e4c7 --> there are duplicates -- e.g. https://gist.github.com/rdhyee/3c9195d639223ce5e4c7#file-gitenberg_repos_list_2-tsv-L39:

2948    1260    Jane-Eyre_1260    Jane Eyre: An Autobiography    en    7011    [1260.txt]
2947    1260    Jane-Eyre--An-Autobiography_1260    Jane Eyre: An Autobiography    en    7011    [1260.txt]

Complicated because we have both https://github.com/GITenberg/Jane-Eyre_1260 and https://github.com/GITenberg/Jane-Eyre--An-Autobiography_1260 on GITenberg.

https://www.flowdock.com/app/gluejar/gitenberg/threads/HG_IKx2gCO1Sla7Xf_ZtYeDduay

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions