Corporate Coaches & Psychologists

Superior human solutions for business

Insight Initiatives Pty Ltd

ACN 089 087 623

Sydney, NSW, Australia

 

Call (+61) (0)2 9918 4639

 

Home Services Products Training Articles About Contact


 


On this page

Questionnaire differences

Indicators of good design

Norm group size


Also read about...

Qualia | Personality

More ePsych News articles


 

 

 

 

 

 

 

How to Identify...

Well Designed Psychometric Profiling Tools for Business Use

There are a lot of tests and questionnaires - what are the differences?

Many tests and questionnaires are now available which purport to measure personality, values, motivation, behaviour or other human characteristics.  There is great variation in the quality of these tools.  The category includes only a small number of credible instruments with years of specialist research behind them, plus a great many ‘cheap and cheerful’ creations which are founded upon little or no research or specialist understanding. 

There are also several psychiatric tools.  These are in a different category and are not appropriate to use in the great majority of occupational contexts.

Unsuitable or ineffective tools will at best add no value - you will merely have wasted a little time and money.  At worst, when key personnel decisions are misled by them, they can be damaging to the point of incurring law suits.  Unfortunately, the differences are typically not obvious to the non-specialist.  


What are the clues which can indicate quality design?

1.  Long questionnaire + moderate report length

There is no magic – it is only possible to get as much out of a questionnaire as you put in.  Therefore, long or detailed text-based reports derived from short or simple questionnaires should be regarded with suspicion.  Such tools are very unlikely to be genuinely measuring everything (or anything) that they purport.

A good rule of thumb when assessing the personality of professional or executive-level staff, is that questionnaires should take at least 20 minutes for the fastest person to complete - and 25 to 40 minutes for most people.  Outputs should be in the form of profile charts or short text descriptions of quite narrow traits. 

2.  Data on scale 'reliability'

The reliability of a scale is a statistical concept which indicates the integrity, reproducibility or stability of a test result.  For example, if a person completes a questionnaire twice, their results should be very similar. 

This is an essential prerequisite for 'predictive validity' – using test results to predict actual on-the-job behaviour because no predictions can be made if the fundamental measures are fluctuating wildly.

(For more about statistics and examples of reliability and validity figures, look in the Interpretation Guide for Qualia.)

3.  Many scales are usually much more useful than a few

Whilst the theoretical Five-factor model of personality is mathematically and intellectually tidy, in most selection and development contexts such tools contribute little.  Most human resources specialists, psychologists and executive coaches find that at least 12 detailed scales are needed to begin adding value to selection decisions and staff development work. 

The ideal is about 24 to 33 scales, because this avoids pigeonholing or categorising people into overly generalised boxes.  It enables you to much better understand each person's individuality, and thereby fit them to specific jobs or understand their performance issues. 

If there are more than 33 scales there begins to be too much overlap between them, so several will be redundant.

4.  Scales must be properly ‘normed’

Personality questionnaire results have little or no meaning unless compared against what is ‘normal’ or 'typical' for a large and relevant sample of the wider population.  Therefore, results should always be plotted against a comparison group (norm group) broadly related to the role in question. 

For example, a tool should be able to tell you "Meredith describes herself as much less ambitious than is typical for most managers and professionals". 

By contrast, a test result presented as a stand alone 'score' is largely meaningless, and highly vulnerable to misinterpretation.

Important:  Statistically and practically, a norm group of 10,000 people is no better than a norm group of 1000 people.  Why?  Because differences will be only fractions of a percent, which is entirely invisible when plotted on a 10 or even 20 point scale.  Nevertheless, some test publishers sell the idea of large norm groups as conferring a special advantage.  They do not.  To suggest so is misleading and, perhaps, used to redirect attention from the more important aspects of design. 

Much more critical is the statistical construction of the tool you are considering, in other words, the care taken by the psychometricians when conducting the original research, design and product testing.

 

Help us to Buzz Up Yahoo - vote for this article!

« Back to Qualia     « Back to ePsych News     » Contact us now

 

Copyright © 2008 Insight Initiatives Pty Ltd
ABN 81 089 087 623