How to write a (hopefully good) paper by Martin Vetterli 1. Introduction 2. To write well..... 3. The structure of the paper 4. Figures and experimental results 5. The talk and the paper 6. Open access, open data, reproducibility 7. Conclusions and outlook 1 Acknowledgements Those from whom I learned.... • teachers • co-authors • J. Kovacevic • V. Goyal • former students • students A. Ortega, USC “Writing a technical paper: A few random thoughts on making life easier for the reader (and your advisor!)” M. Püschel, J. Kovacevic, “How to write a paper” M. Püschel, “Small guide to giving presentations” 2 To get started... Do you want to know what knowledge is? • When you know something, recognize that you know it, • and when you don’t know something, recognize that you don’t know it. That is knowledge. The Analects of Confucius, Book 2 Chapter 17 So: • • • • • writing is about transmission of knowledge there is a ‘’channel’’ between you and the reader Try to maximize capacity! it is a multiuser channel (you compete for reading time...) the reader, by definition, is never wrong! (same goes for reviewers and editors...) • there is a very slow/long feedback loop (e.g. your career…) 3 Writing is also about understanding! Many questions appear once you write your results • What you thought was clear isn’t once you try to explain it on paper • New interesting questions emerge • Leads to more research • Maybe initial result was not so interesting after all… So: • Keep notes • Write about your progress • This is not a paper yet, but it shapes and sharpens your thinking 4 P.Halmos Halmos’ view The basic problem in writing mathematics is the same as writing in biology, writing a novel, or writing directions for assembling a harpsichord: the problem is to communicate an idea. To 1. 2. 3. 4. 5. do so, and to do it clearly, you must have something to say, have someone to say it to, organize what you want to say, arrange it in order you want to say it write it, rewrite it, re-rewrite it, and rerewrite several times, 6. be willing to think hard about and 7. work hard on mechanical details such as diction, notation, and punctuation. That’s all there is to it. P. Halmos, “How to write mathematics” 5 Bertsekas Bertsekas’ view Mathematical writing is the type of writing where mathematics is used as a primary means for expression, deduction, or problem solving. It is fundamentally different from creative and expository writing for two main reasons: 1. It involves the interplay of two languages (natural and math), 2. It requires much slower reading (it expresses complex ideas that must often be read and several times) As a result, many of the rules and suggestions found in writing style manuals are inadequate and/or dot not apply. We propose an approach to mathematical writing based on a set of simple composition rules. 1. Organize in segments, 2. Write segments linearly 3. Consider a hierarchical development 4. Use consistent notation and nomenclature 5. State results consistently 6. Don’t under explain, don’t over explain 7. Tell them what you will tell them, 8. Use suggestive references 9. Consider examples and counterexamples 10. Use visualization when possible D.Bertzekas, TEN SIMPLE RULES FOR MATHEMATICAL WRITING 6 The basic assumptions... What are we trying to accomplish? • you have some worthwhile research results • they are solving a real problem (open problem, new problem) • you are ahead of the crowd You have complete results (...) • no holes as far as you can see • a complete picture • a coherent picture You are willing to communicate your results • no killer patent killed • you feel ready to confront the world (that is, 2.35 reviewers of some Transaction) Full disclosure always pays.... • nothing under the carpet, please Note: • only latex spoken here 7 To write well..... read, read and read! The classics: • any good book is a good start (my favorite is J.L.Borges. e.g. Fictiones) The scientific classics • C.E.Shannon, A Mathematical Theory of Communication, Bell Syst. Tech. Journal, 1948 (do reread this on a regular basis) • I. Daubechies (yes, yes, the 100p. paper) The great authors (around our topics) • R.Gallager • G. Strang • S.Mallat Note to all.... • there has to be a reading culture • I know this is ‘’Playstation generation’’ but for this job, people have to devour the Transactions and arXiv • there has to be a library culture (go find that obscure paper/book) • there has to be a book culture (what book have you bought/read lately?) • (do not read too much on a particular topic before starting research, it can be demotivating.... optimal # of papers to read!) 8 My approach on reading versus research Reading Research 9 To write well.....write, write and write! Writing is a painful process: • I still write on paper, do many iterations, cut-and-paste, drafts, etc. • so do many people... • it takes a lot of time Writing is an iterative process • the spiral method of Halmos (1, 1&2, 1&2&3, ...) • write, rewrite, re-rewrite (and not cut-and-paste!) • let it sit for a while • have other people read it (inc. boy/girlfriend!) • read aloud • make short sentences (many times I have seen “this phrase no verb”...) • do get started (e.g. Camus, “The Pest”) You should be the most critical reader • otherwise, somebody else will.... Is the hardest paper the best paper? • who are you trying to impress ;) • people often spend most space on what took most time... 10 Real estate is at premium Space in paper natural adapted Research effort 11 The tools of the trade The books: • E.B. White, Elements of Style • N.J. Higham, Handbook of writing for the mathematical sciences • P. Halmos, How to write mathematics. • Edward R. Tufte , The Visual Display of Quantitative Information The journals: • IEEE Tr on SP, IP, SAP • IEEE Tr on IT, ToN, Comm, JSAC • the IEEE magazines (SP mag etc) • SIAM Review • Nature, Science, PNAS Some of the data bases: • http://apps.webofknowledge.com: The (in) famous web of science • http://ieeexplore.org/lpdocs/epic03: all of IEEE on line • http://scholar.google.ch/ • http://www.researchgate.net/ 12 Size matters The IEEE societies (IEEE 300K) • IEEE Computer • • • • 120K IEEE Communications IEEE Circuit and Systems IEEE Signal Processing IEEE Information Theory The journals (circulation, impact factor) • Signal Processing Magazine • IEEE Tr. on SP • IEEE Tr. On IP • IEEE Tr. on IT • IEEE Tr. A&S • IEEE SP Letters • IEEE Tr. On Multimedia 75K 25K 20K 6K 16K, 4.9 5K, 3.2 3K, 3.1 4K, 2.6 2K, 2.6 1.5K, 1.6 1K, 1.7 Of course, the question is who reads them… • Impact factors of high visibility journals in an order of magnitude bigger 13 Impact factor An example (just for fun) • Laemmli UK, Cleavage of structural proteins during assembly of head of bacteriophage T4, Nature 227 (5259): 680: 1970. Times Cited : 214’799 ! • Now the man says (Interview, NZZ Folio, Nov. 2005) Q: Would you recommend to a young researcher to develop a method so as to get cited of the? A: No, I would say: Be creative and take risk. Try something new. What is important in science is to ask the right question. And if to answer the right question, you need a method, then develop it. Another example… Grigori Perelman 14 The tyranny of the impact factor... Definitions: • Web of science: a database of papers and citations from other papers, mostly journal papers. • Impact Factor: average number of citations a paper gets in the first 2 years after publication in a given journal • Note: Nature ~ 32, IEEE Tr. on IT ~ 2,5) • H-index: max. number N of papers cited more than N times (example : E. Witten-120! See Hirsch’s paper for details) • Etc. etc. Then the game can start...... • http://apps.webofknowledge.com • http://www.cs.ucla.edu/~palsberg/hnumber.htmlhttp://scholar.google.com • Bottom line: better work in life sciences ;-) • “Not everything that can be counted counts and not everything that counts can be counted” (A.Einstein) 15 IF: “Houston, we have a problem” 16 Statistics on what parts of a paper are read... title & abstract introduction bio.!!! section 1 lit. other sections So do proper waterpouring ! conc. 17 The various reading levels of a paper The title should be catchy, or self-explanatory • Costa: Writing on dirty paper, • Shannon, A Mathematical Theory of Communication • Gallager, Variations on a theme by Huffman • Dubois-Ferrière, Age matters: Efficient route discovery … • M.Kac, Can one hear the shape of a drum Or: no second chance to make a first impression! The abstract is the sales pitch for the paper • why would anybody want to read this paper • it has to pose the problem explicitly, and indicate clearly what is accomplished • Beware of acronyms The Table of Content (ToC) should a l low to survey the pa per • sections have to make sense, with headings that do too • flow, sizes The figures should be self-contained • browsing the p a per through figures only • caption self-contained (be able to read the figure without the text) • text and caption complementary 18 The classic structure of the paper (1) Title and abstract • • • • be careful with affiliations (e.g. EPFL and not Swiss Federal Lausanne Institute) be careful with acks, e.g. to funding agencies (ERC, NCCR, SNF) the author order headache... In doubt, αβtical Different fields have different cultures 1. Introduction and outline • why shall I (the reader ) spend N hours on this • motivation for the problem (why is this important...sorry, it might not be) • it is either the proof of Fermat’s last theorem (no further motivation needed) or you need to properly pose the problem 2. Related work • give credit where credit is due • “good manners” in referencing (you know when you see it) • make sure you set the stage for indicating why what you present is new, better, cheaper, glitzier, cuter... 3. Contributions • You probably know what you have contributed, but no one else does…. 19 The classic structure of the paper (1) Title and abstract: An example Scrutiny of the abstract By Kenneth K. Landes Abstract • The behavior of editors is discussed. What should be covered by an abstract is considered. The importance of the abstract is described. Dictionary definitions of “abstract” are quoted. At the conclusion a revised abstract is presented. Abstract (bis) • The abstract is of utmost importance for it is read by 10 to 500 times more people than hear or read the entire article. It should not be a mere recital of the subjects covered, replete with such expressions as “is discussed” and “is described”. It should be a condensation and concentration of the essential qualities of the paper. 20 The classic structure of the paper (2) Introduction: An example Scrutiny of the introduction Jon Claerbout Abstract: • The introduction to a technical paper should be an invitation to readers to invest their time reading it. • Typically this invitation has three parts 1. The review 2. The claim, and 3. The agenda. • In the claim the author should say why the paper’s agenda is a worthwhile extension of its historical review. • Personal pronouns should be used in the claim and anywhere else the author expresses judgment, opinion, or choice. 21 The classic structure of the paper (3) 4. The meat structure the development carefully make adequate sectioning/subsectioning decide on Lemmas, Propositions, and Theorem(s) (the “1 Thm/paper algorithm”) put details in appendices (for ex., for each Thm, decide if proof is in appendix) theory is never made too easy think of examples, inc. toy examples, figures, diagrams, illustrations, tables 5. The experimental section describe the experimental set up precisely the results should be reproducible the data should be available the presentation of the results is key (see later) 6. Conclusion(s), outlook, further work don’t take the reader for a ride (e.g. Fermat again) 7. Appendices: can be most helpful! 8. Literature: careful please 22 The classic structure of the paper (4) So write an outline first! • structure of thoughts • what are the main ideas you want to get across • make it detailed enough • is the flow adequate (not a random juxtaposition...) • these are not lab notes, chronological, etc. The outline will change • a manuscript is a living animal • it will bite back • it will give you nightmares The skeleton of the paper is • the motivation, problem setting • the “main” theorem(s) • the lemmas and propositions that allow it • the examples that highlight how it all works • the experiments that justify it all 23 Presenting an idea The logic should be clear to anybody (not just you) • logical progression • idea 1 -> idea 2, etc . Be clear: • there are 2 reasons why XYZ is not used in practice. (i) it is not robust in case of. .. (ii) it is absurdly complex for ... Do not let the reader guess what you solved, and what not • this is clear in the math mode, but the same is true in the experimental mode as well • the ‘iff ”. The converses. Strengthening the results. Repeat NO, develop YES • multiresolution approach can be best • like in a good plot of a novel, hints can lead the reader • wet the appetite, give a main course, highlight with dessert 24 How to get ideas and results across (1) Be explicit • put examples (toy examples, real examples) • put tables with usable results (the famous “Daubechies’ filter tables) • spell algorithms out • put matlab/python code in paper or on line Make life easy for the reader • the reader is just as lazy as the writer • it can be shown... show it But don’t be boring! • too explicit can be boring Always ask the dual question also • there might be new research right there! • Or the next paper ;) 25 How to get ideas and results across (2) Make it easy for the user… he/she will use your results! 26 On the most misused word(s) in the literature Optimal • if all the claimed “optimality” were true... we could all retire! Complexity (computational) • what is complex, what measure, O(.), constants, etc. It is easy to see/verify • probably it is not, otherwise it would be written • it can be shown... show it (Fermat again) • it is left as an exercise... probably you can come up with new results As can be clearly seen in the figure... • by the time it is printed, most ‘’obvious’’ differences are washed out • blow up the point 27 On theorems... Halmos’ view • present statement first • statement should be short • assumptions thus provided • no “associated results” in statement • proof follows In engineering • often along the way: “blabla. .. Thus we have proved: Thm 1” • not so nice... I am not ideological about it.... • what flows best is best 28 Presenting graphical information (1) The classic books by E.R.Tufte • The Visual Display of Quantitative Information is about pictures of numbers, how to depict data and enforce statistical honesty • Envisioning Information is about pictures of nouns (maps , aerial photos are about nouns in space, for ex). It is also about visual strategies and colors • Visual Explanations is about pictures of verbs, the representation of mechanism, processes, dynamics, causes and effects (inc. magic tricks!) So, making figures is an art! • takes a lot of effort (that is why most people skip it!) • gets you a lot of mileage (that is what most people forget...) Note: caption format is • Figure X: Block diagram of MP3. (a) Encoder. (b) Decoder. • That is: a main caption and subcaptions C.J.Minard 29 Presenting graphical information (2) from a start-up I know… 30 Presenting graphical information (3) Ten Simple Rules for Better Figures (PLOS, 11.9.14) 1. Know your audience 2. Identify your message 3. Adapt the figure to the support medium 4. Captions are not optional 5. Do not trust defaults 6. Use color effectively 7. Do not mislead the reader 8. Avoid chart junk 9. Message trumps beauty 10. Get the right tool 31 Presenting experimental results (1) Explicit experimental conditions • one realization (the Lena syndrome) • what method on what data set • was there training, was the data outside the training set (no joke!) • apples and oranges? Often, lousy statistics... • confidence intervals • statistical tests • comparisons to bounds (Remember Cramer-Rao) More • • • data better than less... at least if presented well data must be analyzed and interpreted avoid the boring tables …. If you gained insights, so should the reader • the experiments should make a point • make sure the point is not lost (e.g. prove/disprove a model) 32 Presenting experimental results (2) Typical scenarios 33 The (nasty) details (1) Notations • it is like style • it can be a headache • usually, conform to the norm • think about the alphabet, but think about it first • starting with a bad notation will bog you down, sooner or later • simplify, simplify, but not too much • Nk Θk • avoid pedantry, unnecessary generality, etc . • be rigorous 34 The (nasty) details (2) English • probably the easiest to fix (take courses) • but be careful, it is the most obviously annoying thing • “on the other hand”: a poor orphan? • we have found -> we found • in this paper we have found -> we found that • etc. • Cutting, cutting, cutting…. • Make a simple as possible, but not more • punctuation: spaces, no spaces, lower cases, vs; vs etc.... • L.Truss ‘’Eat, shoots, leaves’’: • the title of the book is an amphibology 35 The (nasty) details (3) Affiliations • Yes, it is EPFL (not Université Polytechnique Fédérale de Lausanne) from a famous CV Acknowledgments • Who paid for gets an ack (e.g.: ERC Advanced Investigators Grant: Sparse Sampling: Theory, Algorithms and Applications – SPARSAM – no 247006) The size problem • our overlength page charges seems to overtake the travel budget! • 8 page limit for Tr SP, IP, etc. • hard to predict, but please be careful • (I don’t like the Part I and II slicing either...) even though it works well for some colleague (double citation count...) 36 The (nasty) details (4) Make short sentences, because, unless you are Proust.... « Mais au lieu de la simplicité, c'est le faste que je mettais au plus haut rang, si, après que j'avais forcé Françoise, qui n'en pouvait plus et disait que les jambes " lui rentraient ", à faire les cent pas pendant une heure, je voyais enfin, débouchant de l'allée qui vient de la Porte Dauphine - image pour moi d'un prestige royal, d'une arrivée souveraine telle qu'aucune reine véritable n'a pu m'en donner l'impression dans la suite, parce que j'avais de leur pouvoir une notion moins vague et plus expérimentale, - emportée par le vol de deux chevaux ardents, minces et contournés comme on en voit dans les dessins de Constantin Guys, portant établi sur son siège un énorme cocher fourré comme un cosaque, à côté d'un petit groom rappelant le « tigre « de » feu Baudenord », je voyais - ou plutôt je sentais imprimer sa forme dans mon coeur par une nette et épuisante blessure - une incomparable victoria, à dessein un peu haute et laissant passer à travers son luxe " dernier cri « des allusions aux formes anciennes, au fond de laquelle reposait avec abandon Mme Swann, ses cheveux maintenant blonds avec une seule mèche grise ceints d'un mince bandeau de fleurs, le plus souvent des violettes, d'où descendaient de longs voiles, à la main une ombrelle mauve, aux lèvres un sourire ambigu où je ne voyais que la bienveillance d'une Majesté et où il y avait surtout la provocation de la cocotte, et qu'elle inclinait avec douceur sur les personnes qui la saluaient. » 243 words! 37 Conclusions and further work Conclusion (if at all ;) • DO NOT TAKE THE ABSTRACT and put it in the past tense. • Laziness always shows • Make a wrap up, people will look for your assessment of the work • Contributions can be emphasized and placed in context Future work • Put ideas you think are worthwhile to pursue • Not necessarily ideas you are pursuing already… • Invite others to pursue your path (and get these citations ;) • Do not mislead (Fermat ;) Literature • Put the earliest references, even if they are from 1927 • Put all necessary references • It is nice to get cited, so please be nice in citing ;) 38 Checklist (A.O.) Can a reader with the right background: • • • • • • • • get the basic ideas understand the paper remember what is new in this work follow the proofs replicate the experiments find all assumptions in the text be convinced that this is useful not fall asleep ;) 39 On reproducible research Clairbout’s initiative at Stanford • geophysics • lots of data , code, etc. Donoho’s wavelab, etc. • lot of mileage Examples • SPIHT, wavelab, safecast, sensorscope We should, collectively, make much more! A paper should be • a manuscript (eventually a publication) • a set of documented code (matlab, C, libraries, etc.) • all data that was used • a web document Lab initiative: blue print for what we want to accomplish (A.H.Salavati’s talk) and quality control 40 The art of the quote Finding good quotes is a challenge, here a few On why we do research • I don't want to achieve immortality through my work... I want to achieve it through not dying (Woody Allen) On writing • Most rock journalism is people who can’t write, interviewing people who can’t talk, for people who can’t read (Frank Zappa) 41 How to write? Different people have different habits Victor Hugo • Standing • Eating oranges without peeling them Walking around • It seems it makes us creative 42 Conclusions Writing well is a hard task • we are all students of the art • no easy short-cuts (it will show...) • no pain, no gain... But it is a central task! • you can prove the hardest results, if nobody reads it, it was futile a endeavor • like teaching, one learns by writing • writing things down leads to new in sights , better ways to understand the problem, new research, etc. Your papers are your thesis! • take these 3 or 4 journal papers, and staple them ;) • for good measure, add a good introduction and conclusion • you are done (3 months rather than 6 or 9!) I am looking forward to reading your next paper! 43 How not to do it... Cogno-Intellectualism, Rhetorical logic, and the Crase-Trump Theorem Michael H. F. Wilkinson Institute for Mathematics and Computing Science University of Groningen Abstract- This paper presents a breakthrough in rhetorical logic, a promising field of science, of great value to those writing research proposals. It provides new, and utterly convincing tools for closing embarrassing gaps in your reasoning, without resorting to brute-force methods, such as actually thinking about the problem in the first place. The Craske-Trump Theorem, along with the Trump-Craske Conjecture will allow researchers in any field to use the technique of “Proof by Intimidation” fully. From the Annals of Improbable Research. 44 In preparation: How to do (hopefully) interesting research The most exciting phrase to hear in science, the one that heralds new discoveries, is not 'Eureka!' but 'That's funny...’ Isaac Asimov 45 Coming attraction: The art of talk.... Power point is evil, Power Corrupts. PowerPoint Corrupts Absolutely. by Edward Tufte, Wired Magazine, 2003 I am looking forward to the discussion! 46 Coming attraction: The art of talk.... From: Power point is evil, Power Corrupts. PowerPoint Corrupts Absolutely. 47 References 1. How to Write Mathematics, P.R. Halmos, L'Enseignement Mathematique, t. XVI, fasc. 2, http://www.math.uga.edu/~azoff/courses/halmos.pdf 2. Ten Simple Rules For Mathematical Writing, Dimitri Bertsekas, M.I.T. , APRIL 2002, http://www.mit.edu/~dimitrib/Ten_Rules.pdf 3. Can We Make Mathematics Intelligible? R. P. Boas, American Mathematical Monthly, Vol. 88, No. 10 (Dec., 1981), pp. 727–731. 4. The Science of Scientific Writing, George D. Gopen and Judith A. Swan, American Scientist, Volume 78. 5. Scrutiny of the abstract, Landes, Kenneth K., 1966 http://sepwww.stanford.edu/sep/prof/abscrut.html Scrutiny of the introduction, Jon Claerbout, 1995., http://sepwww.stanford.edu/sep/prof/Intro.html F.Jabr, Why walking helps us think, The New Yorker, Sept 3 2014http://www.newyorker.com/tech/elements/walking-helps-us-think?src=mp Sylvia Nasar, David Grube, Manifold Destiny, A legendary problem and the battle over who solved it. New Yorker, August 28, 2006. http://www.newyorker.com/magazine/2006/08/28/manifold-destiny Rougier NP, Droettboom M, Bourne PE (2014) Ten Simple Rules for Better Figures. PLoS Comput Biol 10(9): e1003833. doi:10.1371/journal.pcbi.1003833 6. 7. 8. 9. 48