Uploaded by Ed Cee

Revised Week 1 Update

advertisement
Week 1 Update
The Data Capturing System will be adopting the ALCOAC-CEA Principles, which are
discussed as follows:
Attributable – Who collected the data, performed a test, completed the documentation or
edited the document and when.
Legible – The documentation and all subsequent changes should be permanent and easy
to read.
Contemporaneous – Documentation should be completed and dated in real time.
Original – Study documents should be the originals, or it should be noted where the
originals are located.
Accurate – Documentation should be comprehensive and truthfully reflect what was
observed.
Complete – Maintenance of complete and up-to-date study documents.
Consistent – The data’s sequence of events is in the expected sequence of operations and
date and time stamped.
Enduring – Paper and electronic data are appropriately recorded in company books,
spreadsheets, databases etc.
Available – Paper and electronic data are required to be readily available for review, audits,
or inspections.
While the following are the steps that the researcher will go through towards
finishing the said data capturing system:
● Development
● Testing
● Training
The system aims to maximize:
● Remote Data Capture
● Automation
While it aims to minimize:
● Risk of Double Entries
● Faulty Audit Trail
● Errors in Company Reports
For us to make sure that we develop an efficient and convenient Data Capturing
System, on our side as the developer of this project, we will be utilizing the use of
Automatic Identification and Data Capture (AIDC). It can be noted that AIDC is to identify,
verify, record, communicate and store information on discrete, packaged or containerized
items. AIDC is rapidly becoming an integral and necessary component in any organization.
These technologies are capable of performing automatic data capture. Modern technology
allows data capture to be quick, accurate and reliable. AIDC is a technology driven solution
to document processing. Prior existing technology was unable to accurately process forms
such as invoices because of the various fields that such documents contain. The data is
then saved electronically for access at a later point. This will make document work
efficient and convenient.
There are many different kinds of AIDC softwares but for this specific system
development, we as the developer, chose to use Optical Character Recognition (OCR)
because it has a recorded accuracy rate of 99.95%. It can be noted that To convert a
document image to electronic text, OCR is suitable software to bring up the TIFF image of
the scanned document, select the necessary text portion and put it into a format where we
can edit the text for accuracy and usability.
OCR converts scanned or photographed images of machine-printed characters into
electronic data. OCR is most extensively used in business to capture high-volume
documents, which delivers the best ROI. OCR technology has several solutions depending
on the size, volume, and content of the paper, and "template recognition" is a common OCR
technique in item processing. Template recognition requires a "template" or "mask" for
each character. An analog or digital representation of a character is compared to each mask
in the system's lexicon. When a printed or handwritten page is scanned with an OCR
application, the document saves as a bit-mapped TIF file. After scanning, an image appears
on screen. Where computer texts are the same. Also, it checks every image line to see if the
dots match a number or letter. OCR software automatically detects and recognizes text from
scanned documents, allowing you to copy, extract, search, and change the contents.
OCR reads text from scanned PDFs (including multi page files), photos, and digital
camera images. OCR converts archival TIFF photos into editable text. OCR does not affect
picture files; it reads text from photos and creates text files for long-term storage and online
document marking. Once OCR documents, proofread. Proofreading checks each letter,
number, word, punctuation mark, symbol, etc. to ensure they match the printed content.
Checking each paragraph, sentence, word, and punctuation mark for damaging content
takes patience and concentration.
At the end or during the completion of this system, it is the hope of the developer to be able
to provide a Data Capturing System that will bring the following to its users:
● Remote and Faster Access
● Ease in Gathering Information from Files
● Save times
● Save cost
● Quick Retrieval
What to expect from the next coming weeks:
● August 28 - presentation and discussion of the 25% of the ongoing system
development
● September 4 - presentation and discussion of the 60% of the ongoing system
development
● September 11 - presentation and discussion of the 80% of the ongoing system
development
● September 18 - presentation and discussion of the 100% of the ongoing system
development
● ABBYY FLEXICAPTURE will be used as the OCR
FlexiCapture is used to analyze and automatically categorize and recognize (OCR) invoices
of varying formats, automatically extracting data needed for entry such as the business
partner’s name, the invoice number, and the billing date. All a human operator does is
confirm the OCR results, then RPA software automatically inputs them into the company
database.
1. Acquiring images
There are several ways to feed invoice images to the program:
● Scanning hardcopy invoices
● Downloading invoice images from an e-mail inbox
● Downloading invoice images from a folder on a computer or an FTP server
2. Recognition and rule checks
After input, invoices are automatically recognized by the program. Data from the invoices is
then checked using rules and a status is assigned to the invoice based on the results of these
checks. Invoices that do not require any editing receive the Valid status and are ready for
export.
3. Verification and correction of errors
Invoices with a status other than Valid need to be reviewed by an Operator. Operators check
these documents and correct errors and warnings that were issued by checking rules.
When all defects have been corrected, an invoice receives the Valid status. Operators can
also confirm statuses that were assigned automatically. The status of an invoice can be
changed manually by an Operator, who has to specify the reason for the status change.
If the program systematically fails to locate fields on invoices, it can be trained by Operators
during the document capture process.
4. Data export
Data export is the final processing stage, during which data is exported as specified in the
program's settings. Documents with different statuses can be exported in different ways,
e.g. documents with the Valid and Exception statuses may be exported to a different folder
than documents with the Rejected status. Several export destinations may be set up, and
invoices can be exported to each of these destinations.
If the status of an invoice was changed manually, the reason for the status change that was
provided by the Operator is exported as well.
The program supports export to data and image files, ODBC-compatible databases, and
Microsoft SharePoint. Its easy-to-use API enables integration with any system.
Steps Undertaken:
1. In the photo below, it can be seen that the application for flexicapture has been
installed in the test mobile device. Said application should be launched in the said
mobile.
2. Once the application was already launched, we created a sample account for the
duration of this project. It can also be seen that said sample account should be
logged in in the SETTINGS page. See photo below:
3. The log-in page should look like this and for us to proceed, we inputted the created
sample account.
4. After logging in, we proceeded in choosing the kind of project that will be used for
this project. Please see photo below:
5. From the parameters shown, the “mobile” option was selected for this specific
project.
6. After such, the sample account was already established and ready for customization
to incorporate other functions that can be attributed to the needs of a business.
On the next update on August 28, we will be able to add:
-
Creation of a database
-
Customization and creation of homepage (log-in page)
-
Trial scanning
Download