During the last few months, on a recurrent fashion basis, there has been a topic surfacing every now and then as it hurts our business: planned downtimes.
This said, all global companies, working round the clock, with a presence on all continents, all working on a single global system/infrastructure will face the same problem.
So how do you manage the planned maintenances which, obviously, will impact the daily activities of your workmates, as they do not understand why, in 2010, you still have to stop the systems to carry a maintenance.
Even today, IT/IS is seen as a cost center, barely, as a profit center. In this case, this becomes even more true as it impacts the productivity of the enterprise. Still you need to ensure the consistency of your systems, of your data, do some housekeeping in order to ensure proper performance and sometimes you’ll even have to apply some patches.
So, in the complex landscape that are so typical to the SAP world, how would you manage a proper lifecycle management of your applications and infrastructure without disturbing the business ?
People are able to organize their activities around a downtime, provided they know when it will happen, with enough lead time. A month in advance would be barely enough to allow the proper organization of everyone around the globe. You have to take into account that this might not just affect your colleagues but also all your business partners (customers and suppliers).
You should look forward to publishing a quarterly/yearly calendar with all major activities. Make it online, accessible to everyone so anyone, who has doubts about the planning, can refer to it.
You may also use some marketing and explain how the major operations (e.g: upgrades, support packages) will benefit to their activities.
During the maintenance, you may provide a status to the users to let them know which applications are available. Social Media has great tools for that (blogs, micro-blogging, status pages).
And in order to avoid anyone forgetting there’s a downtime next week, do not hesitate to send a friendly reminder a month and/or a week before.
For all your communications, you will need to target the right audience and have your direct network transmit the information to their teams.
Efficient management of planned maintenances is barely a question of technology. Of course it helps, and we will discuss it later, but it only comes as a support.
In order to enhance your maintenances and minimize their length, have a detailed plan of the activities, ensuring that all the dependencies are fulfilled, and that the required skills are available.
After the downtime, review the activities and draft your lessons learned: What should be avoided next time. Use this in your plans for the future.
Periodically, review the last months maintenances. Review the planning and see what can be improved: bundling the activities differently, testing more carefully some items in the test environments before applying them into your production. Among the tasks that have been planned, see which could be planned online with limited impact on the systems/users.
Make sure you manage core and sensitive components differently from the others. While a longer downtime of your intranet might be acceptable, it would harldy be the case of your ERP system. In order to manage this, for each task and system you need to define the criticity and the priority. You will end up building a matrix defining what should be done first and what can be postponed.
Define a rollback plan. Worst thing is to be taken aback. For each task, make sure you identify the risks and your rollback plan. The latter should be planned the same way you would do for any other task (priority, duration, dependencies, resources).
Technology can help you lots in limiting the impact of your maintenance on your business, and you should leverage it.
High availability will be one of your toys to pet. Make use of it. You have invested lots for your DRP plan. But HA can help you otherwise. If you need to carry a maintenance on your hardware for example, you might just switch to your mirror infrastructure and bring the applications back there while you’re working on your primary site.
This is also a good opportunity to test your HA infrastructure and potentially your DRP process.
Virtualization has matured over the years and is now able to provide a high degree of flexibility.It creates a greater abstraction between the different layers. Adaptive computing lets you rebalance your resources between components. As a preparation task for your maintenance, you might look forward making temporary changes in your configuration to avoid a complete disruption of the service. Keep in mind that these changes (and reverting them) need to be supported by a detailed documentation.
Monitoring: Planned downtimes are great but they need to be supported by a rationale which will help you define which activities need to be planned. A proactive monitoring of your systems should allow you not only to get the proper alerts but also to identify the trends and thus identify the key actions that need to be undertaken.
Planned maintenance could easily shift from a pure technical topic into a political one whenever it has an impact on the business. IT, in its state of art, is designed to service the business. Thus, make use of creativity and fine analysis in order to provide the best of breed service to your customers.