Author: Jim Cohen

Jim has over two decades of experience focusing on identifying needs and delivering solutions. He is a strong believer in the power of bringing the right people and processes together with Avantix to deliver outcomes that work. “It is all about realising joint value for all parties in the engagement”. Contact him at

…said the Analyst to the EDRMS

Well, of course it does.

When considering a document scanning project, there are a plethora of technical settings which need to be examined.  I am constantly asked what impact scan resolution has on the file size and quality. For the purposes of this article I will focus exclusively on these two aspects and address other considerations in subsequent articles.  (The next article reviews what impact compression has on the image quality).

When scanning files, there is often a play-off between getting a document scanned at the highest possible resolution to provide the best visual quality, yet keeping the file size manageable.

File Size

Firstly, let’s start off by looking at how file size impacts usability.  To put it in unscientific terms, the faster an image appears on the screen – the better and in this context, smaller files open faster than larger files.  Conversely, frustration levels soar, if a user has to wait tens of seconds before an image appears (and even longer for larger files).  This problem may be exacerbated if a user is accessing a hosted solution rather than accessing files on their own computer.

Image Quality

On the other side of this play-off is; image quality.  The rule of thumb is: The higher the resolution – the better the quality. (In reality though, there are a number of examples where this is not so, but this too will be addressed in a subsequent article).

The State Archives lists in its recommendations to have archive scanning performed at 600 PPI (Pixels per inch).  Note: this is a recommendation and not a standard.  The guidelines go on to suggest the resolution can be adjusted to ensure the image is “fit for purpose”.  This suggests the resolution can be adjusted to the appropriate level for the particular document type and circumstances)

The best way to illustrate the effect resolution has on Image Size, when document scanning, is via an example.  I have taken a typical single A4 page of content, scanned it at varying resolutions (both Black & White and Colour). The relative sizes of the documents are listed in the table below:

Resolution (PPI)Uncompressed
Black and White Size (KB)
Colour Size (KB)
20047411 312
3001 06625 380
4001 89245 162
6004 257101 597

This clearly demonstrates that file size almost doubles every time the resolution increases by 100 PPI.  A typical multi-page PDF document consisting of 30 – 40 double sided pages therefore varies quite significantly in size (even when compression is factored in) when comparing a low, to a high resolution scan.  Not only will accessing a large file present frustration, you may well have the IT department up in arms over significant storage space requirements and network traffic bottlenecks.

Output Quality

Then next aspect of the document scanning process to review is the quality of the output and compare differences.  I have taken a screen shot of the same snippet of the document for the resolutions: 200, 300 and 600 PPI respectively (Don’t be concerned about the content.  The snippets of each document are merely to demonstrate the relative quality of result).  I have specifically used a document containing a pattern as it is where the patterns intersect where changing quality is best observed.

Resolution (PPI)B&W ImageColour Image
200200PPI B&W Sample Image200PPI Colour Sample Image
300300PPI B&W Sample Image300PPI Colour Sample Image
600600PPI B&W Sample Image600PPI Colour Sample Image

From the samples, we can see that a high resolution scan creates a crisper clearer image.  This difference is best noted between 200 PPI and the others, but  is less obvious to the naked eye between 300PPI and 600PPI.  On face value, it seems that the slightly higher quality we get with the 600 PPI image, may not necessarily be sufficient to justify creating files which are more than four times the size of the 300 PPI image.  For colour images the impact is amplified.

Fit For Purpose

Now we know the impact resolution has on file size and image quality, the next action item is to define how “Fit for Purpose” applies to your document scanning project. [Refer to page 6 of the Digitisation Disposal Policy –  Queensland State Archives]

The best place to start answering this question is to examine the reason for initiating a document scanning project and the types of records involved.  This has the greatest impact on resolution settings.  If for example, you are bulk scanning financial documents, (Invoices etc.) which only have a retention period of 7 years with no real requirement above being a legible representation of the original, then scanning at 200 – 300 PPI Black & White, may well be sufficient.  This produces a usable image where the content can be read with confidence.  If the same images are intended to be OCR’d for data capture, then you would not go below 300 PPI as going under that would negatively impact the result.

If however, you are imaging a legal document (say), where the expectation is for the image to be as close a representation to the original as possible and the smallest detail is clearly visible, it follows that higher resolution colour may be needed. Even still, you would have to think that 600 PPI would be overkill.  An alternative approach would be to step up to 400 PPI, if there are compliance concerns regarding 300 PPI.

There is a concept called “Point of Diminishing Returns”. There comes a point with resolution where the higher the resolution scans, only makes a marginal difference to quality.  For the example used above, if the document had been scanned to 1200 PPI colour, the increased quality would be minimal but the price paid for file size would be dramatic. Note:  A situation where higher resolutions do make a difference is when scaling up the image, such as a photographic negative that is to be enlarged.  For this article, I am focussing on 1:1 scale document scanning.

Given that each project requires specific considerations around file size and resolution, it is difficult to make hard and fast recommendations to cover all scenarios. This article rather highlights the factors which need to be considered when document scanning.  More often than not, we get asked to perform bulk scanning on documents at 300 PPI (either B&W or Auto-Colour) as this provides a good balance between the size and the quality of output.

Our only Competitive Advantage is to learn faster than our competitors

Yes, for those of you who recognise the quote, I borrowed this title from Arie de Geus (Strategist – Shell Oil) and while it may be over 20 years old, the sentiment applies more now than ever. With a plethora of agile tools available to business, the lead time for any competitor to match or better New Product Design has reduced from what was traditionally years to mere weeks. So in reality to gain competitive advantage through innovation, all that sets us apart from any competitor is for us to learn faster and in turn, implement products, processes or solutions which reflect what we have learned.

This being the case, it stands to reason that a starting point for innovation is to first have comprehensive knowledge of your business’ current position before departing on future initiatives. What can be learned from existing methodologies, processes, experiences and trends and how can these be relied upon as the foundation for change. The decisions made from what you have learned will have significant impact on success.

The cornerstone of Quality Decision Making within an organisation is the quality of data upon which decisions are built. Data integrity plays such a critical role in the organisation, it follows that Data Governance should hold its place as “first among equals” in the spectrum of business tools available to strategists and decision makers.

Because Data Governance matters, best results are achieved through knowledge based on the broadest possible spectrum of data – not just the bottom line produced by the ERP or other core system. In order to gain the advantage of your entire knowledge within the business it is necessary to bring together data sets from all aspects of paper, digital and activities. Data Governance emanates out of the right tools and a defined set of procedures, specifically focussed on achieving these outcomes:
– Increased confidence in decision making
– Decreased Risk
– Better planning and strategising
– Faster identification of improvement areas
– Better staff effectiveness

In other words; the ability for an organisation to Learn all there is, from end to end, elevates the success of future innovation – whatever form it may take.

Have you ever found yourself saying, ‘there has to be a better way’? We hear you.

When it comes to processing data through your organisation, are you fed up with the way things are done?  Have you been forced to follow the status quo because it is just the way things are?  Powerless and exhausted, you know there is a better way, but why doesn’t anyone do something about it?

One question we frequently ask is ‘Why would you continue to do things the same way if there’s a better option?’  After all, times change, technologies change, and the job at hand is not getting any easier.

You often hear talk of increasing efficiencies and reducing costs, but this usually masks the real intention – more work with fewer staff.  As a result, the tasks pile up with no end in sight.

Fear not, as you are not alone.  In fact, we have run into office after office of filing cabinets hiding the disorganisation.  Effectively out of sight, out of mind.

However, the impact is real.  Lost files, compromised confidential data, the loss of relevant information, and even worse, time-consuming processes and a general sense of frustration.





We don’t simply shift from manual or paper-based systems to electronic systems.  We help you to modernise processes by creating a better working environment, more efficient data systems, more effective tracking systems, and increased availability to the information you need to get the job done. It does not need to be like this. Automated solutions can transform the way business is conducted and this is where we come in.  Clients rely on Avantix to work alongside with them, to understand their current problems, long-term goals and the kind of solutions they need to implement to take their business to the next level using automation.

If you are dissatisfied with your existing processes, you have a choice; either continue to do things the old way and get the same results or make your life easier by transforming the way you work.

Our team works with clients of all sizes and in all industries throughout Australia.  We understand creating an automated system is meaningless if it does not benefit everyone – today and tomorrow. Moreover, a clearly defined ROI in under 12 months is essential.

Many of the people we have worked with initially thought their cause was hopeless and would fall neatly into the “too-hard basket”. This is a trap! As soon as our clients saw how seamlessly their goals could be met, the hesitation quickly dissipated.

Will you continue to just think there is a better way or are you willing to take action?  Once you and those around you become champions for change, anything is possible.  The first step is yours. Contact us to find out how others who were in a similar situation to you, have achieved extraordinary results.

Avantix empowers businesses to realise the benefits of digital information. Our core services reduce the time it takes to locate relevant business information, supporting better decision making. Our intelligent information processing and digital solutions transform disparate silos of information into searchable intellectual property that is securely accessible.  Contact us at for more information.