Hey! Guys I need someone to write for me a simple script to detect CPU failure on UNIX AIX. It doesnot have to be complicated.
Here are some ideas
create a script and the output of the script would give us CPU count of 0, 1, 2, 3, 4 . I want Patrol to monitor to be able to monitor that script and the log file would have the output of the script.
The logic behind that is, if the output is for example 0 or 1 or 2,3,4 , Then everything is OK, there are no problem.
And if the CPU doesn't not meet that string it would generate something called AUTOALARM and log that with a message such as your system has dropped one CPU or to CPU.
After logging it, it would write to the script log; And then it would create a ticket For the ESM team. Anyhow, any script you can write to datect CPU failure will be appreciated.
I would suggest using commands such as lsdev or lsconf or vmstat
lsdev -C | grep proc
proc0 Available 00-00 Processor
proc1 Available 00-01 Processor
proc2 Available 00-02 Processor
proc3 Available 00-03 Processor