Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! when resuming training

I tried to run example.py on an A100 (80GB) GPU. It seems there is a bug at line [41] https://github.com/datamllab/LongLM/blob/ee92c841eaf8c6e0989f49c2d63231ba06136345/example.py#L41

The current implementation doesn't load the input_ids tensors onto the device, which causes an error. I replaced the above code, and it's now working. Fixed the issue by adding: `input_ids = tokenizer(prompt, return_tensors="pt").input_ids.to("cuda")`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! when resuming training #37

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! when resuming training #37

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions