Performance Analysis talk at SCALE10x
Last week I gave a talk at the Southern California Linux Expo (SCALE) titled “Performance Analysis: New Tools and Concepts from The Cloud”. There was a great turnout for my talk, which was videoed by...
View ArticleVisualizing Process Snapshots
In Visualizing the Cloud I showed processes and their parent-child hierarchy, across a cloud environment, exploring patterns at different scales. Here I’ll take this a little further and look at...
View ArticleVisualizing Process Execution
In Visualizing Process Snapshots I showed processes and their parent-child hierarchy over time, using snapshots of process information. This approach misses short-lived processes that occur between the...
View ArticleThe USE Method
A serious performance issue arises, and you suspect it’s caused by the server. What do you check first? Back when I was teaching operating system performance, I wanted a methodology my students could...
View ArticleThe USE Method: Solaris Performance Checklist
The USE Method provides a strategy for performing a complete a check of system health, identifying common bottlenecks and errors. For each system resource, metrics for utilization, saturation and...
View ArticleThe USE Method: Linux Performance Checklist
The USE Method provides a strategy for performing a complete a check of system health, identifying common bottlenecks and errors. For each system resource, metrics for utilization, saturation and...
View ArticleLinux Kernel Performance: Flame Graphs
To get the most out of your systems, you want detailed insight into what the operating system kernel is doing. A typical approach is to sample stack traces; however, the data collected can be time...
View ArticleSubsecond Offset Heat Maps
“Wow, that’s weird!”. My subsecond offset visualization type looked great, but others found it weird and unfamiliar. I developed it for inclusion in Joyent’s Cloud Analytics tool for the purposes of...
View Article10 Performance Wins
I work on weird and challenging performance issues in the cloud, often deep inside the operating system kernel. These have made for some interesting blog posts in the past, but there’s a lot more I...
View ArticleFISL13: The USE Method
In July, Bryan Cantrill, Deirdré Straughan and I spoke at FISL, one of the world’s largest open software conferences, in Porto Alegre, Brazil. I had a great time. My talk introduced the USE Method: a...
View ArticleActive Benchmarking
Benchmarking is often done badly: tools are run ad-hoc, without understanding what they are testing or checking that the results are valid. This can lead to poor architectural choices that haunt you...
View ArticleSurge 2012: Real-time in the real world
In September, I attended and spoke at the Surge’12 conference in Baltimore. I highly recommend it for anyone interested in performance. The theme with most talks was problems encountered at scale – and...
View ArticleUSENIX LISA 2010: Visualizations for Performance Analysis
My USENIX LISA talk from 2010 is now available on youtube, also embedded below. The title is Visualizations for Performance Analysis (and more), and showed how the full distribution of data could be...
View ArticleUSENIX LISA 2012: Performance Analysis Methodology
At USENIX LISA 2012, I gave a talk titled Performance Analysis Methodology. This covered ten performance analysis anti-methodologies and methodologies, including the USE Method. I wrote about these in...
View ArticleThe USE Method: SmartOS Performance Checklist
The USE Method provides a strategy for performing a complete check of system health, identifying common bottlenecks and errors. For each system resource, metrics for utilization, saturation and errors...
View ArticleRevealing Hidden Latency Patterns
Latency Heat Map Response time – or latency – is crucial to understand in detail, but many of the common presentations of this data hide important details and patterns. Latency heat maps are an...
View ArticleThe Greatest Tool that Never Worked: har
har is the Hardware Activity Reporter. I’ve never seen it work, but it did help me solve a crucial performance issue. I saw har’s output in an article written in 2001 by Frédéric Parienté, the tool’s...
View ArticleLinux Performance Analysis and Tools
At the Southern California Linux Expo earlier this year (SCaLE 11x) I presented a talk on Linux Performance Analysis and Tools. It’s a great conference, and I was happy to be back. My talk provided an...
View ArticleSystems Performance: Enterprise and the Cloud
Systems performance analysis is an important skill for all computer users, whether you’re trying to understand why your laptop is slow, or optimizing the performance of a large-scale production...
View ArticleDetecting Outliers
In computer performance, we’re especially concerned about latency outliers: very slow database queries, application requests, disk I/O, etc. The term “outlier” is subjective: there is no rigid...
View Article