Counting Servers is Easy, there are a lot of other things that are much harder

James Hamilton has a post saying that is hard to count servers.

At the Microsoft World-Wide Partners Conference, Microsoft CEO Steve Ballmer announced that “We have something over a million servers in our data center infrastructure. Google is bigger than we are. Amazon is a little bit smaller. You get Yahoo! and Facebook, and then everybody else is 100,000 units probably or less.

That’s a surprising data point for a variety of reasons. The most surprising is that the data point was released at all. Just about nobody at the top of the server world chooses to boast with the server count data point. Partly because it’s not all that useful a number but mostly because a single data point is open to a lot of misinterpretation by even skilled industry observers. Basically, it’s pretty hard to see the value of talking about server counts and it is very easy to see the many negative implications that follow from such a number

What is hard is figuring out how many cores these servers have.  What is the age of the servers?  Oldest is 4 years.  Or 3.  What is the rate of adding new data center capacity and how does that relate to overall cores and storage increasing?

The one advantage Microsoft has in making a statement on server count is the companies will not speak up what theirs is.

The first question when thinking about this number is where does the comparative data actually come from?  I know for sure that Amazon has never released server count data. Google hasn’t either although estimates of their server footprint abound. Interestingly the estimates of Google server counts 5 years ago was 1,000,000 servers whereas current estimates have them only in the 900k to 1m range.

We'll see if others speak up on server count or not.  

The US census for years has conducted a study of manufacturing capacity for years.

 

Quarterly Survey of Plant Capacity Utilization (QPC)

The Survey of Plant Capacity Utilization provides statistics on the rates of capacity utilization for the U.S. manufacturing and publishing sectors.

  • The Federal Reserve Board (FRB) and The Department of Defense (DOD) co-fund the survey.
  • The survey collects data on actual, full, and emergency production levels.
  • Data are obtained from manufacturing and publishing establishments by means of a mailed questionnaire.
  • Respondents are asked to report actual production, an estimate of their full production capability, and an estimate of their national emergency production.
  • From these reported values, full and emergency utilization rates are calculated.
  • The survey produces full and emergency utilization rates for the manufacturing and publishing sectors defined by the North American Industrial Classification System (NAICS).
  • Final utilization rates are based on information collected from survey respondents.

Wouldn't it be useful for the FRB and DOD to understand data center capacity and utilization?  It is hard to assess, but that doesn't mean it shouldn't be done.

Something that is Missing from Facebook's Carbon Data Center Disclosure, an int'l colocation presence

I've been staring at the Facebook Carbon Disclosure for over two weeks.  You can see the 2012 numbers here.

NewImage

One of the things that I asked Facebook "where are the international colocation centers?"  Facebook doesn't have any.  There are POPs which are small and are counted in the office space carbon footprint.

It is pretty amazing that in 2012 the East Coast colocation presence was as large as Prineville and Forest City combined. In 2012 there was almost no data center capacity in Europe so content was a run back to the East Coast.  West Coast was taking care of Asia Pacific.

It will be interesting to see what Facebook releases for 2013.  My one request is it will not take 6 months to release this information in June 2014 for the 2013 disclosure.

How many funny things are done in data center because risk aversion is a best practice?

Outages are career killers in data centers and IT.  This leads to a risk aversion behavior that becomes a best practice.

Risk aversion

From Wikipedia, the free encyclopedia
 
 

Risk aversion is a concept in psychologyeconomics, and finance, based on the behavior of humans (especially consumers and investors) while exposed to uncertainty to attempt to reduce that uncertainty.

But, Risk Aversion can lead to an obsession to avoid doing things.  A funny example is Ben Stiller in Along Came Polly.

Ben Stillar's risk aversion is resolved when he enters data about his safe ex-wife and the risky Polly.

Reuben is torn between the free-spirited Polly and the safe and familiar Lisa. To solve this issue he enters information about Polly and Lisa into a computer insurance program which measures risk. The computer tells him that, despite his numerous blunders with her, Polly is the best choice for him.

It is too common a best practice to manage risk by avoiding the path that may have risk.  Risk leads to an outage.  An outage leads to job loss.  Don't do those things that increase risk.

Almost all data center innovators learn to live with risk.  Risk is everywhere.  But, risk aversion can still exist in pockets of an organization when one individual finds comfort in steering clear of all risks they identify.  It's too bad you can't enter information in about their situation and given them the best choice.

Aren't you glad you haven't been hit with a 40% Energy Surcharge like Western Texas Industries?

WSJ has an article on the problem the power transmission system in Western Texas being strained by the higher use of Energy drilling and pumping that is increasing the costs for many industries in areas.

While the largely rural region has enough power plants to supply the growing demand for electricity, the high-voltage transmission network hasn't kept pace. Beginning last summer, a shortage of transmission lines in some areas meant that grid operators couldn't automatically send the cheapest power to customers, but had to turn to more expensive power plants elsewhere in the state, where there was enough transmission capacity. Those higher costs were passed on as surcharges to many large customers.

Here are descriptions of some of the pain.

That isn't good news for executives at Tower Extrusions Ltd., which makes aluminum products like stadium seats and storm gutters in Olney, Texas, about 100 miles west of Fort Worth. The company says its power bills climbed 40% last year.

"The congestion charges are putting me at a huge disadvantage, compared to my competitors near Dallas or in other parts of the state," said Mark McClelland, Tower's general manager.

Even oil and gas companies are being hit by the charges. Kinder Morgan Inc.,KMI -0.84% which produces oil in the Permian Basin, said it had to pay as much as $400,000 in congestion costs on a single day in 2012.

Apache Corp., one of the Permian Basin's top oil producers, said its average costs in the area this year are running about 15% higher, largely due to the power-line congestion costs.

Ouch.  Could imagine if you ran a data center in this area?  There are probably some data center operators who are being hit by these costs.

Digital Realty expresses need for Hierarchical Design in DCIM to organize the data

Before David Schirmacher joined Digital Realty trust we would regularly have chats that could easily last hours.  I don't get a chance to chat with David as much now that he is Sr VP of Data Center Operations, but we still connect in many ideas.  Digital Realty has a paper it released on DCIM.  And a couple of things reminded me how we think the same.

NewImage

And David has a hierarchy which i would have drawn from the bottom up, but David has drawn from top down.

NewImage

Here is more information from the press release.

Digital Realty Addresses Challenges of Establishing Successful DCIM Platform
 
White paper explores the challenges of DCIM platforms and provides a comparative outlook on Digital Realty’s newly launched DCIM solution 
 
San Francisco, CA – July 23, 2013 – Digital Realty Trust, Inc. (NYSE: DLR), a leading global provider of data center solutions, today announced the release of a white paper titled, “Real-Time Monitoring for Data Centers: Comprehensive DCIM Solution Creates Connectivity-Rich Environment.” The paper, authored by David Schirmacher, Senior Vice President of Portfolio Operations for Digital Realty, explores the challenges inherent in establishing a successful data center infrastructure management (DCIM) platform and introduces EnVision, Digital Realty’s recently launched DCIM solution.
Historically, vendors have approached DCIM as a hardware problem by implementing specialized equipment in an attempt to achieve a real-time monitoring and management platform for the interdependent systems across IT and infrastructure. However, the challenge with today’s DCIM platforms isn’t with the hardware, but rather it’s with the data that’s being managed, monitored and analyzed. In short, today’s DCIM solutions need to provide a significantly more comprehensive view of all of the resources within a data center – from the mechanical, electrical and plumbing systems that form the backbone of a facility’s infrastructure to the servers and racks that compose the heart of the IT setup.
The paper explains how a data-driven, connected view of a data center enables data center operators to realize the capacity they need in order to help their firms effectively grow their businesses and to ensure their data centers are yielding optimal results. In particular, with EnVision, Digital Realty’s DCIM solution, the company’s customers gain increased visibility into data center operations, the ability to analyze data in a manner that is digestible and actionable, a user interface and data displays/reports that are tailored to data center operators, and access to historical as well as predictive data.
Interested in sharing some insights from the white paper with your Twitter followers? Check out our list of interesting facts pulled from the paper:
  • #DCIM is an emerging form of #datacenter mngt that bridges gap between traditional facilities systems#IT systems http://ow.ly/neWSD
  • #DCIM solutions must allow users to view info about the #datacenter in a truly connected sense http://ow.ly/neWSD
  • The issue at the core of the #DCIM puzzle is stranded data, per @drdatacenters http://ow.ly/neWSD
  • The focus shifts from displaying data to managing data when looking at #DCIM from an operator’s perspectivehttp://ow.ly/neWSD
  • #DCIM is not a hardware problem, it is a data problem” per @drdatacenters http://ow.ly/neWSD
  • A typical #datacenter, might have 10,000 data points which can turn into billions of #datapoints a year http://ow.ly/neWSD#DCIM
  • A typical #datacenter, if it is fully instrumented, might have 5,000 or 10,000 #datapoints http://ow.ly/neWSD #DCIM
  • @drdatacenters builds a comprehensive #DCIM solution with real-time monitoring for #datacenters http://ow.ly/neWSD
  • Key to a comprehensive #DCIM solution is recognizing the breadth & diversity of the available informationhttp://ow.ly/neWSD
  • “Real-Time Monitoring for #DataCenters: Comp #DCIM Solution Creates Connectivity-Rich Enviro http://ow.ly/neWSD via@drdatacenters
 
To view the full paper, click here: http://ow.ly/neWSD