jonathan
06-04-2009, 04:38 AM
Hello all,
Does anyone know of a single Grid system administration check-list or best-practices document that I can build on to make a definitive System Administration manual?
I am thinking about a standard operations schedule (daily, weekly, fortnightly, monthly, quarterly, annually) that would:
* List best defaults for installations (e.g. setting up dashboard alerts to email)
* List mandatory sys admin tasks for safe grid operation (e.g. checking for hotfixes, log rotation, disks checks etc)
* List recommended option best practices or activities (e.g. benchmarking, physical inspections etc).
I would like to apply some Paretto principles to it too by including preventative maintenance/checks for the most common or serious problems and their causes typically encountered on grids.
I would also like to add a section on hardware acceptance testing best practices and performance benchmarking.
All pointers, suggested list items and advice gratefully received
JD
Does anyone know of a single Grid system administration check-list or best-practices document that I can build on to make a definitive System Administration manual?
I am thinking about a standard operations schedule (daily, weekly, fortnightly, monthly, quarterly, annually) that would:
* List best defaults for installations (e.g. setting up dashboard alerts to email)
* List mandatory sys admin tasks for safe grid operation (e.g. checking for hotfixes, log rotation, disks checks etc)
* List recommended option best practices or activities (e.g. benchmarking, physical inspections etc).
I would like to apply some Paretto principles to it too by including preventative maintenance/checks for the most common or serious problems and their causes typically encountered on grids.
I would also like to add a section on hardware acceptance testing best practices and performance benchmarking.
All pointers, suggested list items and advice gratefully received
JD