Hi Stella,
the major metrics to provide are...
1. Volume of source data because each of the databases expands data by
a specific amount and most of them now also support varying levels of
compression. 5 to 8 TB means different sizes on different databases.
2. Number of rows in large tables.
3. Number of users.
As the size, complexity and number of users goes up
Oracle/DB2/Teradata/Sybase IQ come more into their own over MySQL and
SQL Server. I think you will find few examples of 5-8TB databases
running on SQL Server and not that many on windows/oracle. For
DB2/oracle they are commonplace.
For IQ there are almost none because you get compression in IQ......so
something that is 8TB disk in Oracle is about 800GB disk in IQ. IQ now
holds more of the top 10 records than any other database...amazingly it
is a very well kept secret of out friends at Sybase.
I think it is unrealistic to think that as a consulting provider to a
client you would be given the benefit of years of experience in using
these databases across many projects in short appends on a newsgroup.
Long term experience across a range of databases provides a set of hard
won skills which are of great value to clients should they want someone
to assist in the database decision.
You can ask the vendors. Each vendor produces detailed papers for their
database.
Each of MySQL, SQL Server, Oracle, DB2, Teradata, Sybase IQ have their
strenghts and weaknesses. No vendor is going to tell you anything about
their weaknesses and vendors (and biased consultants) spread plenty of
mis-information and out of date information about competitive products.
For SQL Server (we are on a MSFT newsgroup) MSFT has done an excellent
job of documenting project REAL.
http://www.microsoft.com/sql/solutions/bi/projectreal.mspx This is one of the most open presentations of a BI effort any of the
vendors has produced. MSFT are to be congratulated on their open-ness
on this one!!
My FAQs page discusses some of the considerations.
http://www.peternolan.com/FAQs/tabid/136/Default.aspx but it is only
intended to provide advice for smaller clients. Larger clients are
expected to have staff to advise on these decisions or pay for advice
on these decisions. I have been paid by clients to prove that the
database they wanted to use would work for what they wanted to do with
it.
Having said all that....
Many people consider what database to use as setp 1 of a BI effort. It
is one of the most ineffective places to start. It is far better to
start with what business the business is in and to design and build a
working prototype before deciding on what database to use.
It is now possible to build large scale prototypes and to have the the
ETL completely transportable from one database to the other with only
the cost of converting the table definitions.
Of course, none of the database vendors will tell anyone that it is
possible to migrate a DW between databases with trivial effort!! It is
not in their interests. And MSFT/Oracle have tied their ETL tools
tightly to the database so if you write your ETL in one of these two
ETL products you will live with the database forever.
So....It is now possible to postpone the 'which database' decision
until very late in the process, which also means it is possible to
postpone the database license payment until late in the process.
Indeed, in some clients we have built the large scale prototype on 2
competing databases so that we could really compare apples with apples.
No amount of 'competitive information' is equal to actually running two
databases side by side, latest release, with the vendors competing for
the business. The mix of HW/OS/RDBMS/DISK and versions of all these
makes it pretty much impossible to compare two solutions on
paper....though many try.
I recommend to my clients that if they are undecided about which
database then they are best advised to remain undecided until they
complete the development of the protoype and do a side by side
comparison to prove to themselves which one they feel is best for them.
It is not lost on my clients that the database vendors are far more
likely to offer incentives when the decision process is clear and
imminent and the weaknesses of their product have been exposed in large
scale prototype.... ;-)
Peter
www.peternolan.com