Add Facilities to Chaos Daemon
In Develop a new chaos, we have added a new chaos type named HelloWorldChaos, which will print hello world in chaos-controller-manager. To actually run the chaos, we need to configure some facilities for Chaos Daemon - so that controller-manager can select the specified Pods according to the chaos configuration and sends the chaos request to the chaos-daemon corresponding to these Pods. Once these are done, the chaos-daemon could run the chaos at last.
This guide covers the following steps:
Add selector for HelloWorldChaos
In Chaos Mesh, we have defined the spec.selector field to specify the scope of the chaos by namespace, labels, annotation, etc. You can refer to Define the Scope of Chaos Experiment for more information. To specify the Pods for HelloWorld chaos:
Add the
Specfield inHelloWorldChaos:// HelloWorldChaos is the Schema for the helloworldchaos API
type HelloWorldChaos struct {
metav1.TypeMeta `json:",inline"`
metav1.ObjectMeta `json:"metadata,omitempty"`
// Spec defines the behavior of a pod chaos experiment
Spec HelloWorldSpec `json:"spec"`
}
type HelloWorldSpec struct {
Selector SelectorSpec `json:"selector"`
}
// GetSelector is a getter for Selector (for implementing SelectSpec)
func (in *HelloWorldSpec) GetSelector() SelectorSpec {
return in.Selector
}Generate boilerplate functions for the
specfield. This is required to integrate the resource in Chaos Mesh.make generate
Implement the gRPC interface
In order for chaos-daemon to accept requests from chaos-controller-manager, a new gRPC interface is required for chaos-controller-manager and chaos-daemon. Take the steps below to add the gRPC interface:
Add the RPC in chaosdaemon.proto.
service chaosDaemon {
...
rpc ExecHelloWorldChaos(ExecHelloWorldRequest) returns (google.protobuf.Empty) {}
}
message ExecHelloWorldRequest {
string container_id = 1;
}You will need to update golang code generated by this proto file:
make protoImplement the gRPC service in
chaos-daemon.Add a new file named
helloworld_server.gounder chaosdaemon, with the content as below:package chaosdaemon
import (
"context"
"fmt"
"github.com/golang/protobuf/ptypes/empty"
"github.com/chaos-mesh/chaos-mesh/pkg/bpm"
pb "github.com/chaos-mesh/chaos-mesh/pkg/chaosdaemon/pb"
)
func (s *daemonServer) ExecHelloWorldChaos(ctx context.Context, req *pb.ExecHelloWorldRequest) (*empty.Empty, error) {
log.Info("ExecHelloWorldChaos", "request", req)
pid, err := s.crClient.GetPidFromContainerID(ctx, req.ContainerId)
if err != nil {
return nil, err
}
cmd := bpm.DefaultProcessBuilder("sh", "-c", fmt.Sprintf("echo 'hello' `hostname`")).
SetNS(pid, bpm.UtsNS).
SetContext(ctx).
Build()
out, err := cmd.Output()
if err != nil {
return nil, err
}
if len(out) != 0 {
log.Info("cmd output", "output", string(out))
}
return &empty.Empty{}, nil
}After
chaos-daemonreceives theExecHelloWorldChaosrequest,chaos-daemonwill printhelloto this container's hostname.Send gRPC requests in reconcile.
When a CRD object is updated (for example: create or delete), we need to compare the state specified in the object against the actual state, and then perform operations to make the actual cluster state reflect the state specified. This process is called
reconcile.For
HelloworldChaos,chaos-controller-managerneeds to send chaos request tochaos-daemoninreconcile. To do this, we need to update the filecontrollers/helloworldchaos/types.gocreated in Develop a New Chaos with the content as below:package helloworldchaos
import (
"context"
"errors"
"fmt"
"k8s.io/apimachinery/pkg/runtime"
ctrl "sigs.k8s.io/controller-runtime"
"github.com/chaos-mesh/chaos-mesh/api/v1alpha1"
"github.com/chaos-mesh/chaos-mesh/controllers/common"
"github.com/chaos-mesh/chaos-mesh/controllers/config"
pb "github.com/chaos-mesh/chaos-mesh/pkg/chaosdaemon/pb"
"github.com/chaos-mesh/chaos-mesh/pkg/router"
ctx "github.com/chaos-mesh/chaos-mesh/pkg/router/context"
end "github.com/chaos-mesh/chaos-mesh/pkg/router/endpoint"
"github.com/chaos-mesh/chaos-mesh/pkg/selector"
"github.com/chaos-mesh/chaos-mesh/pkg/utils"
)
type endpoint struct {
ctx.Context
}
// Apply applies helloworld chaos
func (r *endpoint) Apply(ctx context.Context, req ctrl.Request, chaos v1alpha1.InnerObject) error {
r.Log.Info("Apply helloworld chaos")
helloworldchaos, ok := chaos.(*v1alpha1.HelloWorldChaos)
if !ok {
return errors.New("chaos is not helloworldchaos")
}
pods, err := selector.SelectAndFilterPods(ctx, r.Client, r.Reader, &helloworldchaos.Spec, config.ControllerCfg.ClusterScoped, config.ControllerCfg.TargetNamespace, config.ControllerCfg.AllowedNamespaces, config.ControllerCfg.IgnoredNamespaces)
if err != nil {
r.Log.Error(err, "failed to select and filter pods")
return err
}
for _, pod := range pods {
daemonClient, err := utils.NewChaosDaemonClient(ctx, r.Client, &pod, common.ControllerCfg.ChaosDaemonPort)
if err != nil {
r.Log.Error(err, "get chaos daemon client")
return err
}
defer daemonClient.Close()
if len(pod.Status.ContainerStatuses) == 0 {
return fmt.Errorf("%s %s can't get the state of container", pod.Namespace, pod.Name)
}
containerID := pod.Status.ContainerStatuses[0].ContainerID
_, err = daemonClient.ExecHelloWorldChaos(ctx, &pb.ExecHelloWorldRequest{
ContainerId: containerID,
})
if err != nil {
return err
}
}
return nil
}
// Recover means the reconciler recovers the chaos action
func (r *endpoint) Recover(ctx context.Context, req ctrl.Request, chaos v1alpha1.InnerObject) error {
return nil
}
// Object would return the instance of chaos
func (r *endpoint) Object() v1alpha1.InnerObject {
return &v1alpha1.HelloWorldChaos{}
}
func init() {
router.Register("helloworldchaos", &v1alpha1.HelloWorldChaos{}, func(obj runtime.Object) bool {
return true
}, func(ctx ctx.Context) end.Endpoint {
return &endpoint{
Context: ctx,
}
})
}Notes:
In our case here, the
Recoverfunction does nothing becauseHelloWorldChaosonly prints some log and doesn't change anything. You may need to implement theRecoverfunction in your development.
Verify your chaos
Now you are all set. It's time to verify the chaos type you just created. Take the steps below:
Make the Docker image. Refer to Make the Docker image.
Upgrade Chaos Mesh. Since we have already installed Chaos Mesh in Develop a New Chaos, we only need to restart it with the latest image:
kubectl rollout restart deployment chaos-controller-manager -n chaos-testing
kubectl rollout restart daemonset chaos-daemon -n chaos-testingDeploy the Pods for test:
kubectl apply -f https://raw.githubusercontent.com/chaos-mesh/apps/master/ping/busybox-statefulset.yamlThis command deploys two Pods in the
busyboxnamespace.Create the chaos YAML file:
apiVersion: chaos-mesh.org/v1alpha1
kind: HelloWorldChaos
metadata:
name: busybox-helloworld-chaos
spec:
selector:
namespaces:
- busyboxApply the chaos:
kubectl apply -f /path/to/helloworld.yamlVerify your chaos. There are different logs to check to see whether your chaos works as expected:
Check the log of
chaos-controller-manager:kubectl logs chaos-controller-manager-{pod-post-fix} -n chaos-testingThe log is as follows:
2020-09-09T09:13:36.018Z INFO controllers.HelloWorldChaos Reconciling helloworld chaos {"reconciler": "helloworldchaos"}
2020-09-09T09:13:36.018Z INFO controllers.HelloWorldChaos Apply helloworld chaos {"reconciler": "helloworldchaos"}Check the log of
chaos-daemon:kubectl logs chaos-daemon-{pod-post-fix} -n chaos-testingThe log is as follows:
2020-09-09T09:13:36.036Z INFO chaos-daemon-server exec hello world chaos {"request": "container_id:\"docker://8f2918ee05ed587f7074a923cede3bbe5886277faca95d989e513f7b7e831da5\" "}
2020-09-09T09:13:36.044Z INFO chaos-daemon-server build command {"command": "nsenter -u/proc/45664/ns/uts -- sh -c echo 'hello' `hostname`"}
2020-09-09T09:13:36.058Z INFO chaos-daemon-server cmd output {"output": "hello busybox-1\n"}
2020-09-09T09:13:36.064Z INFO chaos-daemon-server exec hello world chaos {"request": "container_id:\"docker://53e982ba5593fa87648edba665ba0f7da3f58df67f8b70a1354ca00447c00524\" "}
2020-09-09T09:13:36.066Z INFO chaos-daemon-server build command {"command": "nsenter -u/proc/45620/ns/uts -- sh -c echo 'hello' `hostname`"}
2020-09-09T09:13:36.070Z INFO chaos-daemon-server cmd output {"output": "hello busybox-0\n"}We can see the
chaos-daemonprintshelloto these two Pods.