Page Comparison

...

Page Properties

Target release

Mercury

Epic

Document status

Status


colour	GreenYellow
title	completein progress

Document owner

Medha Parlikar (Unlicensed)

Designer

Developers

QA

...

Node operators will be responsible for setting up a node monitoring system.
Node operators will be able to use available RCHain documentation to interface with the node metrics API to pull data needed for their monitoring system.

Requirements

#

Title

User Story

Importance

Notes

Status

1

Node emits metrics through the metrics API

Node operators want access to any and all metrics related to node operation and performance.

Must have

The node emits metrics

Expose

estensive

extensive metrics on the system and JVM At minimum the following metrics should be exposed: CPU RAM Disk Network core metrics at the core level JVM performance Garbage collection Size of memory pools Consumption of memory pools Monitoring systems can pull metrics from the node The node does not push metrics The node does not report on itself	Metrics emitted on http port 40403. Complete (metrics emitted via API)
2	Node measures COMM Events	COMM events are a measure of an RChain transaction. The node must report on how many raw comm events are being processed, so we can demonstrate 40K COMM events /second	Must Have	Create a counter of COMM events as a total count of events in the last hour. Rita Allen to confirm with SRE if the metrics should reset in the past hour. COMM events should be measured for Propose (block creation) and Replay RSpace.	Rita Allen to validate the requirement, see what has been implemented.
3	Node reports on CPU Utilization	The Node reports on the percentage of CPU that is being consumed.	Must Have	Report on the percentage of CPU being consumed at the time the request for metrics is being made.	Rita Allen Please confirm the status
4	Node reports on total RAM consumed	The node reports on the amount of RAM being consumed	Must Have	Report on the total amount of RAM being consumed at the time the request for metrics is being made	Rita Allen Please confirm the status
5	Deploy Count	The node reports on the total number of deployments received	Must Have	Report on the total number of times the deploy API receives a request.	Needs to be implemented. Should have a ticket. Add ticket number here Rita Allen
6	Blocks proposed	A validating node proposes blocks	Must Have	Total blocks proposed by the node in the past hour. Metric should reset in the past hour. Rita Allen to confirm.	Needs to be implemented. Should have a ticket. Add ticket number here Rita Allen.
7	Fork Choice Tip	For something like Ethstats.net, it would be good to show the current fork choice tip for each node	Nice to have	Show the hash of the block that is the current fork choice rule.
8	Blocks being processed?
9	Time since last block?
10	Demonstration of a node operating monitoring system	Node operators will have various needs and expectations for their node monitoring system. The RChain node will not dictate the system they use. However, at launch of test net there will be an example of a node operating system to share as an example, along with documentation in the event a node operator wants to reproduce the example monitoring system.	Must have	Create a Pyrofex node monitoring system Document the system so others can create something similar. This system will be created using Prometheus and

Graphana

Grafana
Needed at time of launch of test net

3


11	Documentation on how to export metrics using Prometheus	Node operators need to know how to access the Prometheus integration in the node.	Must have

4

12

Documentation on how to export metrics from metrics API using Scala

Node operators need to know how to interface with the metrics API.

Developers need to know how to interface with the metrics API when integrating features with the node.

Must have

Pawel to help create a template
- Jeremy to use template to create export for required metrics listed above

5


13	Metrics acceptability testing	Validate exposure and pull of metrics works using both Prometheus and the metrics API	Must have	Acceptable when: Node comes up Node emits metrics Metrics are scrapable through metric API Metrics are scrapable with Prometheus Metrics pulled through metric API and Prometheus match

6


14	Integration with Docker compose	Some node operators may want the option to monitor the node using Docker compose	Optional	Integrate node with Docker compose Maintain the integration over time Document the integration - this is already available.

Traceability matrix

Jira Legacy

server	System JIRA
columns	key,summary,type,assignee,reporter,priority,status,resolution,fixversions,sprint
maximumIssues	1000
jqlQuery	project = CORE AND "Epic Link" = CORE-195 ORDER BY created DESC
serverId	50130123-f232-3df4-bccb-c16e7d83cd3e

...

Versions Compared

Old Version 10

New Version 11

Key

Requirements

Traceability matrix