Plagiarism Just Isn’t for Everybody Stephen Burd () Presentation copies available online

advertisement
Plagiarism Just Isn’t for Everybody
Stephen Burd (burd@unm.edu)
Academic Technology Liaison
Presentation copies available online
http://averia.unm.edu
Last revised: 7/17/2016 3:38 AM
Plagiarism

Dictionary definition: the act of
 using another person's words or ideas
 without giving credit to that person

A humorous musical perspective:
 “… in one word he told secret of success in
mathematics: PLAGIARIZE, plagiarize, let no one
else’s work evade your eyes, remember why the good
Lord made your eyes, so don’t shade your eyes, but
plagiarize, plagiarize, plagiarize, … only be sure always
to please call it … RESEARCH”
 Tom Lehrer – Lobachevsky – Songs and More Songs
by Tom Lehrer – Rhino Records
Plagiarism - Perspectives and Implications

Ideas, words, images, and data are the “product”
of researchers’ labor
 Who owns them – author, funder, publisher, public
domain?

Giving credit where credit is due
 It isn’t plagiarism if you do this
 Enable readers of your work to find related work

Honesty about your contribution
 Show how your work builds on others’ work
 Compare and contrast your ideas/work to others’ work
 Definitively show your unique contribution
Editor and Funding Agency Perspectives

Is this original work?
 What’s the balance of original and un-original content?
 Has any portion been published elsewhere?
 If so, where and how much?

Is included material properly cited?
 Is everything that should be cited actually cited?
 Do the citations reference the proper sources?

Has the work been funded before?




By other funding agencies?
By other PIs doing similar things?
By the same PI(s) in the past?
If yes, are the earlier projects cited in this proposal?
What is a Plagiarism Detection Tool?

Modern plagiarism detection software/services
perform the following functions:
 Search for similar or identical text on the web, in publication
databases, and in other document repositories
 Annotate the document to identify matched text
 Enable viewing of matched text sources
 Optionally report an originality or plagiarism “score”

Best to think of the tools as:
 Match detection or originality tools
 NOT – plagiarism detection tools

The distinction implies that plagiarism is a judgment
based in part on evidence of matching and originality
Available Tools

Roll-your-own
 Anyone can search for matches using the search engine of
their choice (e.g., Google or Bing)
 Though workable, using search engines directly has
disadvantages that include:




Lots of cutting, pasting, and typing
Inability to easily get behind “pay walls”
Inefficient for large amounts of suspect material or large numbers of
documents
Turnitin/iThenticate
 The clear market leader
 Turnitin – optimized for class-related use
 iThenticate – optimized for research-oriented use
Capabilities and Limitations

The good:
 Works very well with text, including simple paraphrasing
 Searches documents in multiple languages

The bad:
 Type I (false negative) and Type II (false positive) errors
 Images and sounds aren’t matched
 Not easily applied to larger bodies of work including
document collections, web sites, and blogs

The ugly:
 As with any tool, automated plagiarism detection:
Can be well-used or misused
 Training helps to achieve desired outcomes

iThenticate at UNM

iThenticate currently licensed UNM-wide





Current license expires August 2015
Renewal currently being negotiated
Free use by UNM users
Authorized users – all faculty, staff, and students
Register for an account
 Visit ithenticate.unm.edu



Click the big red button on the right
Access training materials
Access iThenticate
 Accessed as an online application (i.e., a Web application or
software as a service)

http://www.ithenticate.com
About iThenticate Originality Reports

Originality reports contain information about
matches between submission content and various
source databases including:
 Internet-accessible content
 Publication databases and repositories

Originality reports contain a similarity index
 A percentage of matched or unoriginal content
 Lower score is better – but what’s a good score?
Sample Turnitin Originality Report
Drilling Down to Sources
Using Originality Reports - Advice

iThenticate is a tool for identifying some instances of potential
plagiarism
 The tool will generate false positives and negatives
 The author or reviewer needs to:




Examine the matches and decide whether they constitutes plagiarism
Watch for missed matches (e.g., passages that sound too sophisticated/polished,
style mismatches, …)
Determine how similarity (proper or improper) impacts the publication or funding
decision
iThenticate streamlines the process of identifying potential
plagiarism and follow-up investigation
 Similarity indices can be used to identify targets for investigation (e.g.,
above a certain percentage or the top few percentage values)
 Matches are visually summarized and paired to sources
 Source material can be viewed with a single click in most cases

Summary – the tool is simply that – a tool – not a substitute for
human judgment or follow-up effort
 “Every (or any?) tool is a weapon - if you hold it right” – Ani diFranco
Summary

Editors and funding agencies use originality
checking tools to detect plagiarism

You have access to the same tools that they use
 Free to you
 Easy use

Don’t you want to see what they’re going to see
BEFORE they see it?
Download