Creating a better operations environment by thinking abstractly, using agency

I am watching Gabe Newell again in his talk to LBJ school. I owe much of my career change as if it wasn't for Gabe I would not have left Apple to go to Microsoft in 1992. Now some of you may say that if I had stayed at Apple in 1992 i would be better off than going to Microsoft, but the chances of my staying at Apple after 1992 were slim.

And one of the nuggets is how does Valve make decisions on what to do to make better games. at the 28m mark is where Gabe shares Valves insight.

Below is written explanation.

This idea is powerful, because it is a good abstraction. i plan on using this.

Economics is really human behavior/psychology dressed up in fancy technical language. A lot of it is actually fairly common-sense, once you get past the language.

Here, specifically, “agency costs” refers to the principal-agent problem, aka, the agency problem.

In common terms, the principal-agent problem is, how do you (the principal) get someone (the agent) to do something that you want, instead of something that they want.

So if you (the principal) own a business and hire an employee (agent), how do you get the employee to do things to advance your company’s interests rather than their own? For example, you can try to reward someone with power and responsibility to motivate them to do well (that’s in your interest), but if you structure it wrong, they can play the system so that they look good and get the reward while doing things that hurt the company. An example of this would be the bonuses paid to bankers—in theory, they are supposed to reward people for doing good for the company and align their interests with that of ours, but because of the messed-up way performance is measured (and also the lack of a disincentive to not do bad things), it just ends up encouraging people to do shady and risky things that ultimately bankrupt the company while they walk away with huge bonuses.

The agency problem is everywhere. The classic textbook example is of the real estate agent. In theory, by giving them a percentage commission on the sale, you are encouraging them to get as good of a price as possible since the better the price, the better their commission. But after a certain point, they’ll decide that it’s no longer worth spending an extra month of their time to get a small percent of a somewhat better selling price, at which point, their interests (in getting the sale over with, because they have better things to do) supersede your interest (in getting the best possible price).

And it’s in politics, too, where politicians are (supposed to be) the agents of citizens. I don’t think I need to explain this one—it is clearly one area where the agency problem is... quite severe.

Anyway, a good manager is one that’s able to identify and analyze the agency problem and find a good solution to it (there usually isn’t a perfect solution—you try to find the solution with the best balance of upside vs. downside). And many people don’t have a good grasp of this (again, look at the BS that goes on in the financial markets—the principals are almost always getting screwed).
— https://www.reddit.com/r/Steam/comments/2zxcyy/gabe_newell_responds_to_email_asking_about/

The Power of Images and Apple Workflow, communicating status

I have been meaning to play with Apple Workflow. https://workflow.is/

“You’re in control. For the first time.

Build your own workflows with a simple drag-and-drop interface. Mix & match hundreds of actions to create quick shortcuts, manage your media, share content, and much more.

It’s like making your own apps. But a million times easier.”

Apple bought the company earlier this year and I have been meaning to spend time getting my head wrapped about using it. 

My wife is traveling and I wanted to let her know making breakfast and the dog (Bea) is in the kitchen with me.  Below is gif created so fast it was maybe 15 seconds. 

Taking three pictures seems like the right amount to show information and communicate so much. 

After I shared this gif, I cut and pasted and sent it to two of my technical friends I met with earlier in the week who know i like to cook, have been to my house, and know I have a dog.

What was little surprise is one friend asked what the red thing is in the pan with the eggs. It is a temperature sensor for telling you when the eggs are soft, medium, hard. This image gave the status of cooking the eggs. Then my head started spinning what could be done in communicating operations with three pictures. 

Time to experiment with others using this feature. 

Yes my dog did get some of the bacon. :-) 

IMG_0163.GIF

AWS goes with Green Data Center in Middle East (Bahrain) in 2019

The AWS blog announces the Middle East AWS instance coming in 2019. https://aws.amazon.com/blogs/aws/in-the-works-aws-region-in-the-middle-east/ 

“Middle East Region by Early 2019

Today, I am happy to announce that we will be opening an AWS Region in the Middle East by early 2019. The new Region will be based in Bahrain, will be comprised of three Availability Zones at launch, and will give AWS customers and partners the ability to run their workloads and store their data in the Middle East.”

What is not in the official blog post is the green data center effort and thankfully Werner Vogel posted those details here.  http://www.allthingsdistributed.com/2017/09/aws-region-middle-east.html

”In addition to infrastructure, offices, and jobs another investment AWS is making for its customers in the Middle East, and around the world is to run our business in the most environmentally friendly way. One of the important criteria in launching this AWS Region is the opportunity to power it with renewable energy. We chose Bahrain in part due to the country's focus on executing renewable energy goals and its readiness to construct a new solar power facility to meet our power needs. I'm pleased to announce that the Bahrain Energy and Water Authority (EWA) will construct a solar farm that will supply renewable energy to power this infrastructure Region. EWA expects to bring the 100 MW solar farm online in 2019, making it the country's first utility-scale renewable energy project.” 

AWS joins the rest of the big boys and discusses environmental criteria as part of its data center site selection.

And you can guess that others are feeling the pressure.  It wasn’t too long ago that having a green data center was only being discussed by Google. Apple, Facebook, Microsoft, and AWS all discuss the environmental impact of its data centers. 

Two different ways of planning to attend a conference

Here is a post https://datacenterfrontier.com/how-to-get-the-most-out-of-data-center-conferences/?utm_source=twitter&utm_medium=social&utm_campaign=SocialWarfare by Bill Kleyman on “How to get the most out of a Data Center Conference”

There is some good advice in it - check out the schedule, plan logistics, know the schedule, etc.  Out of the long post here is the closing.

”Finally, just make sure you enjoy yourself and make some new friends. Honestly, that’s the best part of the conference – learning something new and meeting up with dear friends and industry colleagues.” 

After reading the post I found that there is different way I use.

The perspective I use when attending a conference is a networking exercise. Most of you think of networking like social networking. I am referring to networking like it is an engineering exercise. As an engineer what is the current state of my network and how can I use the conference to improve my network.  

#1 is network with the conference staff. They can help you find people and get insight that is hard to get without a face-to-face conversation.  I always do this at DCD and 7x24 Exchange.

#2 are any of my friends going to the conference and what are their plans. Friends expand your ability to network at an event. 

#3 identify the particular people you are looking for at the conference. Those connections you want to reinforce and the new ones you want to make.

At the end of the conference if you have improved your network then it is a success.

Google has a mindset perspective from its early days giving it an advantage over many

In 1998, Google had a $100k check from Andy Becholsheim. In 1998 you could buy between 5-15 Compaq Servers that were used for web content. To make a high Availability system you would have a hot spare which could mean you have 1/2 the available resources. Google took the path that few have taken back then to use consumer components.

IMG_0160.JPG

Above is the 1st Google Servers. The first iteration of Google production servers was built with inexpensive hardware and was designed to be very fault-tolerant.

In 2013, Google published it Datacenters as a Computer paper. http://www.morganclaypool.com/doi/pdf/10.2200/S00516ED2V01Y201306CAC024

A key part of this paper is discussion of hardware failure. 

“1.6.6 HANDLING FAILURES

The sheer scale of WSCs requires that Internet services software tolerate relatively high component fault rates. Disk drives, for example, can exhibit annualized failure rates higher than 4% [123, 137]. Di erent deployments have reported between 1.2 and 16 average server-level restarts per year. With such high component failure rates, an application running across thousands of machines may need to react to failure conditions on an hourly basis. We expand on this topic further on Chapter 2, which describes the application domain, and Chapter 7, which deals with fault statistics.”

Google has come a long ways from using inexpensive hardware, but what has been carried forward is how to deal with failures. 

Some may think 2 nodes in a system are required for high availability, but the smart ones know that you need 3 nodes and really want 5 nodes in the system.