00:04
Hello, I'm Dean Pompilio, and welcome to less than five for module 10 in the Virtual Ization Configuration
00:11
Installation Management course. This lesson will be talking about alarms and how useful they are for your managing your environment. We'll look at what's involved with creating a condition based trigger for your alarm or an event based trigger.
00:25
We'll see how easy it is to view your alarms and acknowledge them
00:29
and wrap things up by doing lab number 18.
00:35
So an alarm is a way to get notified when an event or condition exists, and you have a lot of flexibility in how you create
00:43
the triggers that will allow the alarm to happen.
00:48
There's a lot of default alarms that get created when you install the center for Hosts of'em plenty of via default alarms to choose from there.
00:57
And then we can to make custom alarms for all these different areas in your host. Your cluster data centers, distributed switches,
01:06
data storage or networks,
01:08
and it's kind of easy to get carried away, actually with all the possibilities. But a
01:14
ah mature virtual environment should have a reasonable set of alarms configured
01:19
so that the administrator can get notified automatically when certain problems crop up or when certain problems
01:25
are starting to get worse.
01:27
We have the ability to
01:30
go from a normal range of operation to a warning level and then from the warning level to the alert level. And then you can configure alarms to go from alert back toe warning from warning back to normal so you can see a problem as it gets worse. And as it gets better and how you decide to set up the alarms would be up to you,
01:49
for example, some of the alarms you might.
01:53
She has previously mentioned default alarms
01:56
errors with your host. Maybe the CPU usage on the host is too high.
02:00
VM has been powered on or avian has been powered off. Maybe there's a problem with a license
02:05
or you're running out of space in one of your data stores.
02:08
These are all things you want to know about,
02:10
as the problem is going from normal to a warning level so you can set thresholds to decide when you want to know about it. To give you enough time to fix the issue before becomes critical,
02:23
fair, simple to create alarm all you do is selected item in the inventory, right click and say at alarm.
02:30
Once an alarm has been triggered, you can also right Click it when you're looking at the alarms, tab, alarms and events, and you can announce the alarm,
02:38
meaning that you're showing that you have at least looked at it.
02:43
But that doesn't clear the alarm. You have toe right click again and say clear if you want to. Clearly love. Sometimes you want to acknowledge an alarm,
02:50
but keep it in the list of triggered alarms because you want to see what happened and went.
02:55
So that case acknowledging just
02:58
basically removes the
03:00
the visual display up that the alarm has been triggered because you'll get a little red, diamond or yellow triangle on your inventory. Object when alarm has been triggered.
03:10
You can also copy the text of the alarm to your clipboard,
03:14
which might be useful in case you're trying to troubleshoot an error message, and you want to be able to pace that information to a search engine or or other
03:27
So our alarm settings
03:30
are involving either specific conditions or state
03:34
of some. Some object in your inventory or specific events that are occurring for that object.
03:42
When you look at the ad alarm dialogue box, you'll notice that it also has an able enable and disabled check box.
03:50
This is very useful as well, because you can create your alarms.
03:53
Leave them all disabled until you've got them all created, and then turn them on and see what you know if they're working the way you wish.
04:00
If an alarm turns out to be generating a lot of alerts, but it's not really doing what you want, you can always go in and disable it
04:08
without having to delete it and then re create it later.
04:11
So this gives you the ability to deal with your alarms in the fashion and a workflow that works best for you.
04:17
So when we're setting up a condition trigger, we have to think about
04:23
the warning that we're actually trying to.
04:28
some of these examples here u V m a memory usage, host, usage of CPU or memory,
04:35
and then we have to think about the length of that condition,
04:40
and once the once the length of the condition has been met, then we can generate a warning or an alert.
04:46
Same thing with the alert way. Figure out what we want to alert on.
04:50
It could be the same as the warning condition. Usually that's the case.
04:55
And then you set the condition length for the warning here.
05:00
we can say that if if we CPU usage is above 75%
05:05
you have the above A CZ below is equal to
05:10
so you can you can set some different parameters here. In this case, we want to know the CPU usage is above 75% for five minutes.
05:17
That's a reasonable alarm to set for a warning. That means you're VM is getting busy,
05:23
but it hasn't reached a critical stage yet.
05:25
So in this case, I might want to know about that so I could set up
05:29
the alarm. Thio, notify me in various different ways and we'll cover that shortly.
05:34
However, I can also sudden alert
05:36
Now if CP usage is above 90% for five minutes, I definitely want to set an alert. That means that my virtual machine is running out of resources and I may need to take some kind of action
05:48
in order to prevent the problem for becoming worse.
05:56
So the triggers that we define for for these two conditions, whether it's a warning or alert, very configurable
06:03
if I'm setting up conditions. However, with events now I'm using arguments, operators and values.
06:12
I can say that, for instance, I could make an alarm that says, When the guests the West is shut down on this particular object sending alert.
06:19
In this case, we're just looking for the event. We don't care how long the condition lasts. If the event happens, we want to know about it.
06:27
So that changes the kinds of situations where you want to get an event based trigger used versus a condition based trigger
06:34
condition. Triggers are more about managing your environment,
06:38
you know, on a day to day basis, hour to hour,
06:42
whereas of Empress triggers are looking for some some event that happens. That may be a rare thing.
06:47
Like like maybe one of your hosts get shut down.
06:50
That shouldn't be happening on a regular basis. So maybe we set up a event for that,
06:57
or maybe a host or one of your V EMS powers down, and we want to know when that happens
07:02
in the case of power down. Setting a length of the condition wouldn't make any sense. It's just a event
07:10
once, Once you trigger
07:12
based on a condition or event, you have to think about how you want to
07:15
change the reporting parameters.
07:18
So what we can do is
07:20
a judge. Adjust the range of frequency of the reporting to avoid getting repeat alerts and to also deal with small fluctuations. For instance, if you specify the range,
07:32
you can specify a percentage above or below the limit that you set.
07:39
going back to this example if I had a CPU usage above 75% for five minutes,
07:47
that this alert will trigger after five minutes after another five minutes. If it's still above 75% I'll get another alert. Maybe I don't want to get too many of those,
07:58
ah percentage above or below this in order to limit
08:03
I rather to deal with small fluctuations. If it's you know, 77%
08:11
you know, maybe I don't want to know about. So
08:13
if I said 75% for five minutes, I could say on Lee. Let me know if this goes up by 10%
08:22
So if I added seven and 1/2 percent, I'd get 82 a half percent.
08:26
So now on. Lee sent me another alert. If I go
08:30
10% above what my limit is currently set or 10% below
08:33
just depends on how you want to configure that.
08:37
Then we have the frequency adjustment.
08:39
This means that you can say, even though you're looking for a five minute window for the condition to exist, I can also
08:46
limit how many times I get notified within a certain time. Rage,
08:50
maybe only notify me once within a 10 minute ranger once within a 15 minute range.
08:56
That way, especially if you're getting alerts in the middle of the night, you're you're limiting the amount of disruption.
09:03
or just in the middle of the day, even you're limiting the amount of disruption that it causes. You get the first alert, you decide to look into it. You know, after a certain amount of time, another alert look, it will happen. If the condition still exists,
09:16
then the reporting options are done. So then you think about what actions you'd like to take.
09:20
When the alarm gets triggered, you can send an email. This is a great option. Since everybody's checking email, you might get automatic notification on your mobile device that in e mails come in
09:31
can also send an S and M p trap.
09:35
If so, if you're using S and M P to manager your devices on your network, this is also useful
09:41
or you can run a command. The command could be a script. It could be
09:45
anything you wish. You just have to specify it. Have a way to reach the absolute path to that command from your configuration dialogue box.
09:56
We also have to think about the different states.
10:00
So as I was mentioning earlier, when we go from a normal status to a warning status, we could trigger an alarm there.
10:07
When we go from warning to alert, we can also trigger an alarm
10:11
and reverse is also true. Going from alert back down the warning or from warning back down to normal. We can also configure alarms for these,
10:18
and this gives you the flexibility to say that
10:22
if I'm going from normal to warning, give me an alert. Give it some time on Lee. If it exceeds some parameters that I define as being this condition appears to be lasting longer than I would expect. Then send me the the next alarm when we go to the alert level
10:39
and then on the way back down, you could also do that.
10:43
For instance, this might be useful. Let's say you're
10:46
using this scenario 75% for five minutes for a warning or 90% for an alert.
10:52
It might be that your
10:54
of your VM gets busy for five minutes at 75% then it goes up to 90%. And somewhere in that 90%
11:01
are somewhere in that next five minutes it goes back down below 90%.
11:07
So I'll get an alert toe warning message telling me that that the condition is starting to get better on its own.
11:13
I can get another warning
11:15
or another alert, rather from the warning to the normal level telling me that I'm going from
11:20
the 75% back down to something below 75%.
11:24
So if you were stuck in a meeting or you are unable to take action, you could at least get notified when the problems is getting worse. And then when it's getting better. Maybe there was some temporary processing that caused the problem, and it's been completed, so now you're VM is back to normal.
11:37
Lastly, we have the notification configuration. Here you can select email
11:45
or SMTP or your command window,
11:48
and you just specify the server info. If you have an SMTP server configured in your environment, you put inside the address. You can pick the port number or you specify an email account that you'd like to use for sending messages to.
12:03
So after you looked at all this as a recap, we know what with the alarms are what kinds of default alarms air set up where you can make custom alarms.
12:11
Some examples of those
12:13
right clicking an alarm to add it to an inventory object, right clicking to acknowledge or clear.
12:20
And then we talked about the different conditions or events that would allow you to trigger an alarm
12:24
and the ability to enable I'm disabled.
12:28
Then you need to know the warning and condition length from going from normal T warning from warning to alert from alert back to warning and from warning back to normal so we can have a lot of flexibility for how we set this up.
12:41
And then we saw some of the ways you can get notified when your conditions are met,
12:45
and that helps you as an administrator, deal with your environment without having to constantly manually check these things.
12:50
Okay, so lab number 18 is your next task. In this lab,
12:54
you'll be setting up alarms that look for a condition, setting up alarms. Look for an event.
13:00
You'll trigger the alarm, using some of the tools that we use an earlier labs to generate CPU traffic.
13:05
And then you'll disable the alarms. And that concludes lesson number five. Thank you.