Skip to content

Observability support  #64

@JamesWrigley

Description

@JamesWrigley

Things that we could do:

  • Keep counters that track communication statistics between workers, like the number of RPC calls and the amount of data sent/received.
  • Expose a hook API so packages can register functions that will be called on sending/receiving data. Not sure about the performance implications of this.
  • Read-only API to check the status of a worker, the time of last contact, and state of worker-to-worker connections.

Use cases:

  • Tracking network usage between workers on HPC clusters
  • Better support for profiling tools like Extrae (Roadmap #1 (comment))

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions