Time
1 hour 43 minutes
Difficulty
Intermediate
CEU/CPE
2

Video Transcription

00:00
All right, everyone, let's begin learning about the best practices of network maintenance.
00:07
How can we maintain our network in the most efficient and effective way possible?
00:12
We'll be looking at handling network issues,
00:15
managing changes in the network
00:17
and determining possible improvements to number, design and operations to prevent or reduce the risk of network interruptions or issues.
00:31
So what will be the best way to handle network issue if something happened, what would you do
00:36
first? Good practice with handling, and that work issue is to keep calm and not panic. Yes, even it is imperative to restore the network or service as your media as possible.
00:47
Remember your procedures. Understand the case or issue being encountered. Identify priority levels. Priority levels may very amongst industries and on the case by case basis.
00:59
Do your best to resolve the issue within your expertise and based on the training you received. If you feel the incident is more than you can handle,
01:07
escalate the matter to a specialized team or hires here.
01:11
Your aim is to restore back to the last stable working condition, which is why you must always have a backup configuration set ready.
01:19
What if there is a sudden interruption of the network. The changes were made. Everything seemed fine, but something still went wrong.
01:26
Immediately escalate to a specialized team or involve your vendors if necessary.
01:34
After all the trouble shooting and the issue still hasn't been resolved.
01:41
Always think of a Plan B, which is deciding on an interim or permanent solution. The first Consider
01:47
understand the problem and identify the cause of the issue.
01:51
Use existing equipment or components in your inventory, for example, using spares or executing redundancy manually
01:59
and or modifying their work design if necessary.
02:02
An interim solution is usually unstable, but it gets the network up and running to give you a window of opportunity to plan for a better solution.
02:10
A permanent solution could be your old equipment failing before having the chance to replace it during your next maintenance window, so you could probably install your new intended equipment. But of course it will be upon approval from your senior management.
02:24
There are times, however, of course, where, no matter how much you plan,
02:30
a new event will just pop a lot of the blue.
02:32
But as previously mentioned, keep calm, and no matter what happens or how limited time you feel. You may have plan your next move and make the best choice carefully.
02:44
Part of now operations is to always think fast on your feet when such events arise.
02:52
But what if you solve the issue? What next? Conduct a failure analysis such as What was the severity and impact of the problem? Was it the recurring issue?
03:01
What caused it? Is there a problem with the current network architecture?
03:06
Is it a faulty equipment? Or was a mystery that requires further investigation? An escalation?
03:13
All these information will help you to prepare your reports and documentations. That should include your findings,
03:20
the solution or solutions to the problem and recommendations to prevent the same incident from happening again in the future.
03:31
But what if you encounter the mystery case? How do you handle this?
03:37
For starters, gather as much information about the event and she can.
03:40
So what systems or services were affected for how long
03:45
Check system logs and your monitoring system to identify patterns of the issue?
03:51
Did it just happen? Always is a recurring events
03:53
discussed findings with necessary parties such as your team and or third party Venters, if needed. solidify your findings and justifications for the event,
04:04
whether the event was predicted or not. Always do reports and document your findings. Clue what solution or solutions were implemented. And what are your recommendations to help you or your team with similar future incidents?
04:17
Your recommendations can include new tools, hardware software. If you felt there were limitations that restricted you in your investigation,
04:32
so our first quizzes you encounter mystery events,
04:36
which are the following actions. Do you need to perform for further investigation of the issue?
04:53
And it's all options
05:00
now on to How can we manage that were changes.
05:03
So you're no work is live in running, but you notice improvements need to be made.
05:09
When was it decided to make changes in our operations?
05:14
You can do so when making routine changes. For example, installing software updates, usually due to bug fixes or feature and security improvements. Non routine changes such as replacing legacy equipment
05:27
and only make changes when necessary. For example, when a device failure is detected and it's affected the quality of your services or the efficiency over their work operations,
05:38
discuss with your team and raise the matter to see your management on your proposed action plan.
05:44
Remember, though, that all these changes needs to be included in your maintenance plans, Richard choir approval and acknowledgment before apply.
05:56
Routing changes are predictable events, so it's a little easier to prepare procedures for them and document. The outcome of that change
06:04
is worth knowing that any changes should always be tracked and documented.
06:09
You can even have the option to automate routine processes. For example, you could enable remote access after office hours using only ssh secure shell.
06:19
This way, it gives you more time to focus on priority task and operations
06:28
for non routine changes. There are many considerations to be taken to accounts before proceeding with the change,
06:34
and you will definitely need clear justifications, such as Is the change necessary?
06:41
Will this improve operations?
06:43
What is wrong with the current operations? Hence you only very good references, which can be gathered from your N. M S, logs
06:51
or even customer feedback to prove why you feel the changes needed
06:57
what resources are required? Are there additional costs? How long is the delivery time
07:03
plan? How you can use her resource is effectively including your costs and schedule to implement the change.
07:11
What are your activities for the change? Prepare a procedure plan and have someone else review it, which is usually your superior for their approval.
07:18
What is your role Back plan? There are times when you're implementation. May not work as expected,
07:25
so always have a backup plan or roll back to resume previous working configurations.
07:31
Then document your findings of what happened or why the implementation failed.
07:40
What could go wrong
07:41
will be a backup plan. If it isn't work,
07:45
do a risk assessment to discuss potential wrists and impacts,
07:48
and then your action plans and how to solve the issues. If it rises,
07:55
what will be affected?
07:56
List the systems, services and the end users that will be affected by the change.
08:01
Is downtime required? For how long is it acceptable to management and urine users?
08:07
If yes, prepared to notify affected and uses of the downtime at least two weeks in advance to avoid surprises and to give them time to plan out what they may need to do during that temporary outage,
08:20
how will this be tested? Create a test plan to include the procedures you will perform and how you will test them in an environment that resembles the exact or as comparable is possible with your life. Never. Operations
08:35
also include what parameters you will use to determine whether the test is considered a success.
08:43
So far, we've discussed how to handle never corporations and how to manage network changes.
08:48
Let's move on to how we can improve network operations.
08:52
The aspects of this include improving your network architecture, the design of a stable and robust network, and you're operating procedures the way you handle a. Manage your operations effectively and efficiently as possible
09:09
with improving that work. Architecture. Avoid single points of failures, implement redundancies and equipment links
09:18
proper rolling, etcetera
09:20
before making improvements. Remember to keep it simple and keep it consistent. Do not over complicate things for yourself or your team.
09:28
And lastly,
09:30
think of the future. Making that work scalable to support for new implementations, for example, is your network I. P. V. Six. Ready? Remember, technology nowadays is continuously changing, so always be prepared and read up on the latest updates.
09:46
When implementing redundancy, consider that failure may still occur. So plan the possible failures that may arise and always monitor the conditions of your systems because redundancy does that guarantee? Ah, 100% stability and performance.
10:01
Keep track of your redundancy configurations as he can get complicated.
10:07
Hence why? To keep it simple and consistent throughout your entire network.
10:11
Vulnerabilities are possible. Such a suffer bugs. So always be ready to counter such possibilities.
10:20
As you plan for scalability,
10:22
consider your current services and future services. In other words, ensure your system is sufficient to support future services for maybe the next 3 to 5 years, depending on your organization, however, do not purchase equipment with features or high specs that you do not need.
10:41
For example, you were the one that purchase equipment used for data centers into ah, home use environment
10:48
use hierarchical network design concept, which are the core distribution and access layers. This, of course, depends on the size of your organization. However, this model is definitely a good reference to consider as you plan for scalability
11:03
and plan your progress for the implementation. This is a little bit more to do with project management considering the scope of work. The implementation by fees is which is usually recommended. Budget schedule resource is and so forth,
11:20
which all need to be taken into account
11:24
now When improving operating procedures, consider the services delivered to end users. Monitor your network instead. Priority alerts. Be proactive in your repairs. Plan well for never changes and conduct audits on your processes using your operations.
11:46
If you focus on improving services delivered to end users, consider the following What did your services, For example, Are they accessing your Web or email servers? Are they using Internet or back hold services? Who are your end users? Are they small medium businesses or enterprise customers?
12:05
What components are recorded to provide the service?
12:07
For example, the power cables, Browder's switches, firewalls, etcetera.
12:15
What happens when an issue occurs are all documentations, reliable and up to date?
12:20
Are the current tells you have in that
12:26
part of improving their work? Operations includes monitoring your network.
12:31
This helps you to ensure your assistance. Are working as expected
12:35
in delivering quality services to your end users. Hence, consider checking the condition of end to end services and your network capacity from your in the mess and verify in the abnormalities,
12:48
component failures or slow user experiences. If redundancy exists, is it working? Have often do you test the redundancy to ensure it's working
12:58
and also prioritize your alerts.
13:03
Another factor that helps contribute to providing excellent network operation services to end users is to be proactive with repairs. So, for example, and there any problems, schedule a time to fix them and adhere to proper change. Management procedures do not cause sudden outages while making adjustments.
13:22
Asked for approvals for the change, and once you're given the go signal, notify the correct people as early as possible before making the change.
13:31
Think about what could go wrong. Plan and test for solutions in the event that failure occurs.
13:41
An effective method of identifying whether improvements need to be made on your operations is to conduct audits. When auditing, consider what needs to be evaluated. For example, are configurations standardize throughout your entire network? Are the current standards considered at optimal efficiency with current
14:01
resource? Is
14:03
all redundancy links in equipment working when the failure occurs in the primary system or length?
14:07
Have you conducted tests to ensure confidence of making proposed changes into life operations and our old documentations and reports available and updated with the latest information for reference?
14:22
I'm sure many of you notice how repetitive thes Rasi *** are but it is very important to consider and to put into practice
14:30
in the long run, you will definitely understand how these factors could help you in your operations.
14:37
All right, next quiz.
14:39
Before implementing a new system into live operations, what would be the best assurance to justify that? The system will most likely to work
15:00
conduct tests in a similar environment as life operations always conduct tests before implementing anything new to your running network operations and services?
15:11
In summary, we discussed the best practices off handling network issues, known and unknown or mystery events. How we can properly manage network changes.
15:22
Whether it is a routine or non routine change, it is best to apply them only when necessary, and how we can improve our never corporations by enhancing network architecture and operation procedures.
15:37
And guess what?
15:39
We're at the end of module three, and at the end of this course
15:43
I want to extend my thanks to everyone who attended this course, and I really hope you learned something out of it that could help in your operations environment and work responsibilities. Finally, I wish everyone all the best in your network operations journey

Network Operational Management

This course is designed to help network specialists understand the responsibilities and best practices involved with monitoring and managing network operations.

Instructed By

Instructor Profile Image
Sheane Jayne
Network Engineer
Instructor