Acrobat OCR - Parima Sahbai

Adobe

Introducing text recognition (OCR) to Acrobat mobile

Modernizing the PDF

Adobe Acrobat is well-known for popularizing PDFs as our digital counterparts to paper documents. With the popularity of mobile platforms, it is increasingly important to offer on-par capabilities with desktop applications right on your phone. An important part in modernizing Acrobat and PDFs is bringing Optical Text Recognition (OCR) to mobile platforms. With this feature, struggling to copy text and edit from an image is a problem of the past.

Company: Adobe
Duration: 1.5 years
Tools: Adobe XD, Figma
Platform: iOS and Android mobile devices
Role: Wireframing, iteration, prototyping

The Product

Adobe Acrobat offers multiple avenues of working on-the-go. Coming out in 2011 and 2012 respectively, you can download and use Acrobat on both iOS and Android platforms. During the past decade, the apps have ramped up competatively to offer the same great desktop features on mobile. The Acrobat mobile team at Adobe is directly responsible for the following products that I had the pleasure to work on:

Adobe Acrobat Mobile on iOS (including iPad)

Adobe Acrobat Mobile on Android (including tablets)

Adobe Scan on iOS

Adobe Scan on Android

Responsible for both free and paid user flows

My Role

During my time on this project, I was directly responsible for owning and developing this feature from start to end on both iOS and Android platforms. My responsibilities included:

High/low fidelity UI work
Prototyping
Competitive Analysis Research
Close collaboration with PMs and developers to ensure technical feasibility and correct implementation
Post build QA walk throughs and inspection
Project and timeline management

Journey of the product: from competitive analysis, to journeymapping/bluelines and final designs

Problem | Research

Scanned PDFs are essentially images, in PDF format. They have many limitations which do not allow the user to make changes to the document or even perform operations such as copy or highlight text. 17%-18% of all files opened in-app are scanned files which means a large chunk of our users are missing key functionalities that will enable them to successfully interact with their documents.

Working with PMs and developers

One of the most challenging aspects of this project was the complicated and restrictive technical limitations that popped up during the entire duration of the project. By utilizing my background (BA and MCS) in computer science I was able to successfully navigate the complicated technical nature of the project and work closely with developers to ensure that we strike a balance between what is possible, and what is ideal for the user resourcefully.

Research, understanding, and Explorations

blue lines, why we're doing it ppt, what it is like in the backend, apple vision explorations

Searchable OCR: Allows for quick processing by laying an invisible text layer over the original document. Users can now highlight, comment, and copy text.
Editbale OCR: The document will be reconstructed, effectively allowing users to edit and modify text and images like any other PDF
Scanned PDF: A PDF that is essentially an image and has no text data/strings that a PDF reader would be able to detect.

Dividing the project into phases | MVP and beyond

This large feature roll-out required design to understand and scope work to come up with an MVP which could scale to allow full features within the span of the project timeline. The phases were divided into the following:

Phase 1 (iOS rollout)

Features:

Copy
Highlight
Underline
Search

Improvements:

Faster processing (compared to editable OCR, coming up in the next phase)
Original document left intact

Restrictions:

Unable to automatically run OCR on document open due to technical restrictions
No text editing
Limited number of languages supported

Phase 2 (iOS rollout)

Features:

Text editing

Restrictions:

Long processing time: the longer the document, the slower experience, worst case is 17 seconds for one page
Reconstruction of original document == unpredictable irreversable results due to autosave on mobile

Phase 3 (iOS rollout)

Features:

Expanded supported languages

Restrictions:

Large datapacks for languages
Manual downloads needed for certain langauges

For the remainder of the project, I went on to work on Android version with the Android engineering team which followed the same release pattern but had it's own designs and challenges.

Post release

As of December 2023, OCR was released to all users on all acrobat platforms (iOS, Android, iPad, tablet) and has been well recieved in the app store. In addition to good reception from the general public, the feature brought generated a total GNARR of over $400,000 from just subscriptions on Android and $235,000 on iOS respectively over a year.

The future and scalability

While designing this feature, I was extremely concerned with scalability and worked closely with developers to ensure that sometime in the future we can enable automatic language detection and run OCR automatically without blocking the user. The goal is for the user to not even notice the difference between a scanned image PDF and a regular text enabled PDF.

Takeaways

Gaining a deep understanding of the technology behind your designs is extremely important, having a background in computer science allows me to have deep technical discussions with developers and expedite and gain the trust of stakeholders much faster allowing me to create intentional and feasible designs that can be translated into code.