Uploaded image for project: 'csit'
  1. csit
  2. CSIT-177

T-rex pkill issue during initialization

XMLWordPrintable

      Critical: brings down perf testbeds. Properly detect if T-rex is running with max number of pkill retries (with timeout between each try) prior new instance is started. If Jenkins Job is forced to stop then T-rex remains in running state changing its parent process and once new instance is being initialized, T-rex gets self into D state with no option of recovery. After TBx down due to this problem - best case we can log in to tg-host and troubleshoot TRex process (we are not able to kill process) and the only solution is to reboot the tg-host, worst case kernel messed up and we are not able to login in and we need to reboot the tg-host.

            pmikus Peter Mikus
            pmikus Peter Mikus
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

              Created:
              Updated:
              Resolved:

                Estimated:
                Original Estimate - 3 days
                3d
                Remaining:
                Remaining Estimate - 3 days
                3d
                Logged:
                Time Spent - Not Specified
                Not Specified