Confessions of a DB2 Geek

It’s finally arrived!  Confessions of a DB2 Geek is here to bring you all the latest facts, updates and insight into the world of DB2.

Our resident DB2 Geeks will be regularly sharing their thoughts on the latest trends in DB2. Click here to find out what our geeks are up to.

In our first installment we bring you – Pure Genius – Mainframe style data sharing comes to AIX.

Get the lowdown on data sharing! Download the podcast.

With the launch of pureScale at the end of 2009 IBM announced to the world that they were bringing mainframe style data sharing to the AIX platform – with other platforms to follow.  Given that z/OS is the gold standard for reliability this is an exciting prospect.

At Triton our resident DB2 Geeks have got a wealth of expertise in DB2 both on and off the mainframe.  In this podcast James Gill – a mainframe data sharing expert – chats to Clair Ross - a DB2 LUW expert – about what lessons can be learned from the mainframe data sharing world and how pureScale can bring this to the midrange market.

 

  • Share/Bookmark

No Comments | Filed under DB2, DB2 Geek

Using SQL Replication with DB2’s HADR

Take a look at this interesting new IBM presentation on SQL replication with HADR:


Find more videos like this on ChannelDB2

  • Share/Bookmark

No Comments | Filed under Availability & disaster recovery, DB2

Data Management presentations available on slideshare

We will now be posting all our presentations and articles online through slideshare.  Click here to take a look.

You can find recent presentations on data growth and high availability technologies for DB2.  look out for more content coming soon.

  • Share/Bookmark

No Comments | Filed under DB2

Insight from the sharp-end of IT

Take a look at IT management from a different perspective. James Cockayne, Principal DBA at Camelot Group gives us his views on key issues for DBAs, virtualization solutions and how a DBA can ensure they get that all important beauty sleep! – http://www.it-director.com/technology/data_mgmt/content.php?cid=11469

  • Share/Bookmark

1 Comment | Filed under Availability & disaster recovery

Paul Stoker chats to James Cockayne of EDS about some of the headaches of managing databases in a 24/7 environment.

Just how important is it for organisations to ensure they have a robust database availability & disaster recovery solution?

The database is always the lynchpin of an application.  Without the database, the application doesn’t function, and there’s a bunch of angry users wanting to know why (not to mention angry managers seeing money slipping down the drain).  More and more there is an expectation that the services we provide will always be available when the customer wants them, and as expectations get higher the tolerance for failure gets lower.  It is vital that we not only ensure the customer can access the service they want during the traditional ‘working day’, but increasingly all the way up to 24×7 availability.

 

What are some of the issues for database administrators looking after databases in a 24/7 environment?

DB2 UDB on distributed platforms offers a highly reliable database solution, but in the real world many different applications and scripts run against our databases which are hosted on various pieces of complicated hardware and problems will occur.  An outage during the day is bad enough, but at least the DBAs and other support teams are at work and ready to respond.  When supporting a 24×7 system a failure in the middle of the night can be, literally, a nightmare as the DBA has to get connected and conduct a diagnosis and fix as quickly as possible – after all, in a 24×7 system the middle of the night here in the UK is daytime for customers elsewhere in the world.  Often an issue will mean calling (and usually waking up) other support personnel from teams such as server support, storage, or networks for diagnosis or resolution – not to mention managers on the escalation list…

 

So how can those midnight calls best be avoided?

For both happier customers and support staff a backup standby server with an automated failover solution is often the answer.  This isn’t a panacea for all database woes – but in a well run and maintained production DB2 environment a sizable percentage of any unplanned downtime will be caused by hardware or third party software issues, so having the database automatically move onto another server can make a serious difference to the time it takes to get the applications up and running again. 

 

So, will end users still experience an outage with an automated failover system?

Depending on the exact solution, and the application the database supports, the time to get the production service available again could be anything from a matter of minutes if using features such as DB2’s HADR in conjunction with Automatic Client Reroute, up to an hour if relying solely on mechanisms such as HACMP and scripting to start up DB2 on a cold server and restart the applications.  In reality though, the actual detection of a problem and the subsequent time it takes to make a decision to fail over is usually much more than a matter of minutes for HADR. But compare this to potentially several hours of work to manually restore a database onto a standby server and reconfigure so the application can see it, and the benefit is clear – it’s always worth setting the management expectations in advance though: it’s automatic, not instant! Both of these are active-passive solutions with a powered up standby server sitting idly by ready to take over from the primary server if a problem occurs  

 

You mention HADR and HACMP as potential solutions, what are the main differences?

HADR is a warm standby solution and gives not only fast failover but also high availability while applying DB2 FixPaks, effectively meaning the only downtime to install maintenance is switching between primary and standby servers. However it needs twice as much disk space because your database requires both the primary and standby servers have their own copy of the database; a pure HACMP failover solution will use a cold standby and so can share one copy of the database on disk. Also, HADR is high availability at the database layer of the application stack, whilst HACMP is high availability at the Operating System level.

 

So, the database automatically fails-over to the standby server.  Great!  That means you can carry on your beauty sleep undisturbed, right?

Not necessarily, automatic failover isn’t the answer to all database problems – indeed if your database failover kicks into action the DBA and other support teams will still need to diagnose and fix the problem and unfortunately it’s still the case that things will go wrong which can’t be solved by failover (batch jobs will still error and require sorting out, user error will certainly still occur…) so this isn’t an end to phone calls in the middle of the night.  What automatic failover gives us is a faster response to getting the production systems available to the users again, and enables fault diagnosis and resolution to take place possibly at a more sociable hour – certainly at a more careful and considered pace.

 

Okay, so I understand the potential pitfalls with Active-Passive solutions, but is there an alternative that can address these issues and provide close to 100% availability?

An alternative would be an active-active solution where both the primary and standby servers are operational and data is maintained on both servers. Having an active-active solution can provide the ultimate in high availability for DB2 UDB on distributed platforms.  Whilst traditional thinking is based around minimising the time taken for the database to become available on a standby server after a serious problem on the primary, with active-active the standby server is already active – using a virtualisation product it’s possible for a failure to be invisible both to the users and even the application itself.  Couple this with the ability to perform maintenance on one server whilst the other(s) continue to be available to the application and it’s realistic to be aiming for 100% availability.  There are also other benefits to going for an active-active solution which are especially relevant at this time:  in failover solutions there is a standby server which has purchase and maintenance costs but is effectively only used in an emergency.  With active-active this server can be providing more of a return on that investment – by taking on some of the workload which would otherwise all be routed through the one ‘primary’ server performance can be improved.  In a system with a growing workload utilising this solution could even save the cost of upgrading the hardware to cope with demand.

 

Thanks for your time. Hopefully you won’t get called out tonight, in the early hours, to resolve any Production server problems…

 

  • Share/Bookmark

1 Comment | Filed under Availability & disaster recovery, DB2, Paul Stoker, Xkoto