University of California at San Diego computer engineers have
come up with a proposal to change the way data centers and
clusters could be configured to give them
better scalability at a cheaper cost for the interconnect
speeds by altering the topology. They say they can use the
same hardware, and drop costs to one seventh of what current
examples are using.
"Large companies are putting together server farms of
tens of thousands of computers - even approaching 100-thousand,
and the big challenge is to interconnect all these computers so
that they can talk to each other as quickly as possible, without
incurring significant costs," said Amin Vahdat, a professor
of Computer Science and Engineering (CSE) in UC San Diego's
Jacobs School of Engineering. "We are proposing a new
topology for Ethernet data center connectivity." SIGCOMM
paper in PDF: A Scalable,
Commodity, Data Center Network Architecture
It looks like this is cheaper because it provides more
redundancy, you have 10 1Gbit switches more-or-less in parallel
instead of 1 way more expensive 10Gbit switch.
Agreed, they're not doing anything new at all. This is
actually a fairly common architecture in HA data centers.
(I guess that's what happens when you spend most of your time
in academia instead of the real world. :)
"We assume a cost of $7,000 for each 48-port GigE switch at
the edge..."
Dell sells their
6248 GbE switch, with additional 48 Gbps stacking module and
a 10 GbE uplink for $3,000. And before anyone whines that
it is a Dell and not a "real" switch like a Cisco, HP
or Juniper, you need to realize that GbE switches are now
commodity. These Dells support everything that the big boys
do. In many cases, they're made in the same factories.
I've installed 3 of these just recently, with 2 connected via
the stacking module (48 Gbps) and the third via 10 GbE
coax. I'm in the middle of monitoring traffic patterns
to determine the best layout, but I'm 99% certain that
workstations <--> servers and servers <--> servers,
but not a lot of workstations <--> workstations.
Servers are on the "core" switch, and each of them (5)
are connected via quad-port NICs.
My main question is do I assign separate NIC interfaces, or just
aggregate the quad into a single 4 Gbps bonded connection.
I'm also tempted to include the 2 on-board NICs in that
equation, making it a 6-port bundle per server.
The specific benefits of aggregated vs assigned will depend a bit
on the switch fabric's own max capacity and how it responds
to being saturated, (might be worth a little more testing and
architecture tweaking to handle burst cases), but as a general
rule aggregated is always better.
High End Computing Network Redesign
University of California at San Diego computer engineers have come up with a proposal to change the way data centers and clusters could be configured to give them better scalability at a cheaper cost for the interconnect speeds by altering the topology. They say they can use the same hardware, and drop costs to one seventh of what current examples are using.
"Large companies are putting together server farms of tens of thousands of computers - even approaching 100-thousand, and the big challenge is to interconnect all these computers so that they can talk to each other as quickly as possible, without incurring significant costs," said Amin Vahdat, a professor of Computer Science and Engineering (CSE) in UC San Diego's Jacobs School of Engineering. "We are proposing a new topology for Ethernet data center connectivity." SIGCOMM paper in PDF: A Scalable, Commodity, Data Center Network Architecture