Why Executive Hiring Decisions Go Wrong

The most common source of executive hiring failure is not a bad candidate — it is a good candidate evaluated inconsistently. Different interviewers assess different dimensions. The CEO's impression from the first conversation anchors the entire evaluation. The candidate who tells the best story beats the candidate who has the best track record. And the hiring decision is made in a debrief where people discuss their feelings about the candidate rather than systematically comparing evidence.

The Executive Candidate Scorecard is designed to prevent this outcome. It standardises what is assessed, sequences the evaluation in a way that surfaces the most relevant information, and creates a common language for the debrief conversation that separates evidence from impression.

The Five Evaluation Dimensions

1. Stage Fit (25 points): Has the candidate operated at comparable scale, stage, and organisational complexity to what this role requires?

2. Functional Excellence (25 points): Does the candidate have demonstrated, specific results in the functional domain this role owns?

3. Leadership Quality (20 points): How does the candidate build, develop, and retain teams? What is their track record as a people leader?

4. Commercial Orientation (15 points): Does the candidate think like a business leader, or purely as a functional specialist?

5. Cultural Alignment (15 points): Do the candidate's values, working style, and operating preferences match the company's current environment?

Scoring Stage Fit (25 points)

Stage fit is the dimension most frequently ignored and most consequential to search outcome. A candidate who has operated at the right stage scores 20–25 points. A candidate who has operated one stage above (e.g., Series D experience for a Series B role) scores 10–15 — they may be overqualified in ways that create problems. A candidate who has operated one stage below scores 5–10 — they may grow into the role but carry execution risk.

Stage fit is assessed by asking the candidate to describe their operating environment specifically: how large was the team they joined, how large did they grow it, what infrastructure existed when they joined and what did they build, and what was the ARR or scale of the business at the start and end of their tenure. Vague answers to these questions indicate either misrepresentation or that the candidate was not the primary architect of the outcomes they are describing.

Scoring Functional Excellence (25 points)

Functional excellence is assessed through specific result narratives — not career highlights, but specific situations where the candidate owned a measurable outcome and can describe exactly what they did and why it worked. The question structure that produces the most useful responses: "Tell me about the most significant thing you built or changed in your last role — specifically what it was, what it looked like before and after, and what you personally did to produce that outcome."

Candidates who score 20–25 points on functional excellence can answer this question with specificity that the interviewer can probe and verify. Candidates who score below 15 produce narrative that is either too general, attributes the outcome to the team rather than to specific decisions they made, or cannot withstand follow-up questioning.

Scoring Leadership Quality (20 points)

Leadership quality is best assessed through reference conversations rather than interviews — because candidates are reliably unreliable reporters of their own leadership behaviour. The reference question that produces the most useful information: "How did [candidate] handle a situation where a team member was underperforming? Walk me through specifically what they did."

References who describe concrete, specific interventions — a direct conversation, a performance plan, a role change — are describing a manager who addresses performance directly. References who describe the candidate as "always supportive" and "great with people" without specific examples are frequently describing someone who avoids difficult management conversations. The scorecard weights reference quality alongside interview performance for this dimension.

Using the Scorecard in Debrief

The debrief conversation should be structured as a scorecard review, not a free-form discussion. Each interviewer presents their scores and the specific evidence behind them before anyone else responds. This prevents anchoring — the first person to speak in an unstructured debrief disproportionately influences the group's assessment. After all scores are presented, the group identifies where assessments diverged and specifically what evidence each person is working from. Divergence is not a problem — it is information. It identifies either that different interviewers saw different things, or that the evidence is ambiguous and additional information is required before a decision is made.

"41 days. A $275K search. Two firms failed in 60+ days. That's not luck — that's a different system."

— Majhi Group case study. Read the full case study →