-
Notifications
You must be signed in to change notification settings - Fork 48
Add neuron-device-plugin load test to the test bed #499
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Changes from 11 commits
72ce8b9
92d1d25
f85a86a
65bcba2
5557936
9c363db
c22ef1a
697fcd8
88c8b1f
0affed3
a661b0c
1c040a1
733fcd5
f99d4f0
c830538
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,65 @@ | ||
| {{$uniformQps := DefaultParam .CL2_UNIFORM_QPS 500}} | ||
| {{$neuronResourcesPerPod := DefaultParam .CL2_NEURON_RESOURCES_PER_POD 64}} | ||
| {{$neuronPods := DefaultParam .CL2_NEURON_PODS .Nodes}} | ||
|
|
||
| name: neuron-workers | ||
| namespace: | ||
| number: 1 | ||
| tuningSets: | ||
| - name: UniformQPS | ||
| qpsLoad: | ||
| qps: {{$uniformQps}} | ||
|
|
||
| steps: | ||
| - name: Start measurements | ||
| measurements: | ||
| - Identifier: PodStartupLatency | ||
| Method: PodStartupLatency | ||
| Params: | ||
| action: start | ||
| labelSelector: group = neuron-worker | ||
| threshold: 60s | ||
|
||
| - name: Create pods | ||
| phases: | ||
| - namespaceRange: | ||
| min: 1 | ||
| max: 1 | ||
| replicasPerNamespace: {{$neuronPods}} | ||
| tuningSet: UniformQPS | ||
| objectBundle: | ||
| - basename: neuron-worker | ||
| objectTemplatePath: pod.yaml | ||
| templateFillMap: | ||
| Group: neuron-worker | ||
| NeuronResources: {{$neuronResourcesPerPod}} | ||
|
|
||
| - name: Wait for pods to be running | ||
| measurements: | ||
| - Identifier: WaitForRunningPods | ||
| Method: WaitForRunningPods | ||
| Params: | ||
| action: gather | ||
| desiredPodCount: {{$neuronPods}} | ||
| labelSelector: group = neuron-worker | ||
| timeout: 5m | ||
|
|
||
| - name: Measure pod startup latency | ||
| measurements: | ||
| - Identifier: PodStartupLatency | ||
| Method: PodStartupLatency | ||
| Params: | ||
| action: gather | ||
|
|
||
| - name: Delete pods | ||
| phases: | ||
| - namespaceRange: | ||
| min: 1 | ||
| max: 1 | ||
| replicasPerNamespace: 0 | ||
| tuningSet: UniformQPS | ||
| objectBundle: | ||
| - basename: neuron-worker | ||
| objectTemplatePath: pod.yaml | ||
| templateFillMap: | ||
| Group: neuron-worker | ||
| NeuronResources: {{$neuronResourcesPerPod}} | ||
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,19 @@ | ||
| --- | ||
| apiVersion: v1 | ||
| kind: Pod | ||
| metadata: | ||
| generateName: neuron-worker- | ||
| labels: | ||
| group: {{.Group}} | ||
| spec: | ||
| containers: | ||
| - name: main | ||
| image: public.ecr.aws/amazonlinux/amazonlinux:2023 | ||
| command: | ||
| - "sleep" | ||
| - "infinity" | ||
| resources: | ||
| requests: | ||
| aws.amazon.com/neuron: "{{.NeuronResources}}" | ||
shvbsle marked this conversation as resolved.
Show resolved
Hide resolved
|
||
| limits: | ||
| aws.amazon.com/neuron: "{{.NeuronResources}}" | ||
Uh oh!
There was an error while loading. Please reload this page.