Thinking in Clojure 26-Jul-16 Jumping in We’ll quickly go through Clojure’s data types, some basic functions, and basic syntax Then we’ll get to the good stuff! 2 Clojure’s data types Clojure has: Lists, enclosed in parentheses and separated by spaces or commas: (a 17 "Plenty of parentheses") Functions: (fn [x] (first(rest x)) Numbers: All Java numeric types, plus ratios and exact decimals: 5, 5.3, 5.3e30, 077, 0xFF00FF, 3/5, 5.3M Strings, as in Java: "She said \"Hello\"" Characters: \a, \5, \n, \newline, \tab, etc. The booleans true and false nil, equivalent to Java’s null Symbols, which stand for themselves: :meadow, :CIS-554 Vectors: [5 :a "hi!"] Maps: {:one 1, :two 2} Sets: #{:prolog :clojure} 3 Some basic Clojure functions Syntax of a function call: (function args) Basic operations—sequences (seq) are lists, sets, maps, vectors: (quote arg) or 'arg, to keep arg from being evaluated Basic arithmetic (+ args), (- args), (* args), (/ args) , (< args), etc. Basic logic (first seq) is the first element in the sequence (or nil) (rest seq) is what remains when the first element is removed (or nil) (cons arg seq) returns a new sequence with arg as its first element (= args) tests whether its args are equal (empty? seq), (list? seq), (seq? arg), (nil? arg) are more tests (and args), (or args), (not arg), (if condition result1 result2) Defining values (def name value) defines the name to be the given value (defn name argv value) is shorthand for (def name argv value), where argv is a vector 4 Functions and special forms The arguments to a function are evaluated before the function is called A special form looks just like a function, but it gets its arguments unevaluated Example: (* 2 (+ 3 4)) The function * is called with the arguments 2 and 7 The special form itself decides when and whether to evaluate its arguments quote does not evaluate its argument if evaluates its first argument, then decides which of the second and third arguments to evaluate Clojure allows you to define your own special forms This means you can define your own control structures 5 A typical Clojure function (defn first-double-letter "Returns the first doubled letter in a string, or nil." [s] (if (< (count s) 2) nil (if (= (first s) (second s)) (first s) (first-double-letter (rest s)) ) ) ) user=> (first-double-letter "Pennsylvania") \n user=> (first-double-letter '(1 2 3 4 3 5 5 4 6)) 5 6 It’s easier with cond cond is an if … then … else if … then … else … construct: (cond test1 result1 test2 result2 … testN resultN) It requires an even number of parameters (one result for each test) The symbol :else may be used as the last test (defn first-double-letter "Returns the first doubled letter in a string, or nil." [s] (cond (< (count s) 2) nil (= (first s) (second s)) (first s) :else (first-double-letter (rest s)) ) ) user=> (first-double-letter "Pennsylvania") \n user=> (first-double-letter '(1 2 3 4 3 5 5 4 6)) 5 7 It’s all about recursion Some rules of doing recursion: 1. 2. 3. 4. Handle the base cases directly (without recursion) Recur only with a simpler case Don’t use global variables Don’t “look down” into the recursion—that will just confuse you In Clojure you are almost always working with a list or some similar sequence Lisp programmers say, “Do something with the head, and recur with the tail” Clojure doesn’t have global variables Clojure’s terms for “head” and “tail” are “first” and “rest” This pretty much covers rules 1 and 2 above This covers rule #3 above Rule 4 always holds. Think about what you are doing now, not what some recursive call is doing 8 Functional programming Clojure is functional—what does that mean? Functions are like functions in math—called with the same arguments, they always return the same result Functions are values, or first-class objects This means: No “global variables,” no dependence on external values, and no side effects! Functions can be passed as parameters to functions, returned as the value of functions, created as needed, stored in data structures, and there are operations on functions that produce new functions The “blub paradox” applies—the value added is substantial, but not obvious to an imperative or object-oriented programmer Data is immutable (like strings in Java) Clojure’s data structures are designed to make this efficient Immutable data greatly simplifies concurrent programming Because data is immutable, loops are unnecessary (use recursion instead!) 9 Costs and benefits Costs of functional programming It’s weird and unfamiliar How can you do anything without objects, mutable variables, or loops? As a manager, functional programmers are hard to find (and expensive!) Clojure, and Lisp dialects generally, have too many parentheses! Benefits of functional programs Easier to write correct programs (Loops are used primarily to change the values of things) “Yeah, right!” – “No, really! All data is local and immutable.” Easier to write unit tests, because function values depend only on inputs Much easier to write concurrent programs Operations on collections make code simpler and more concise The simpler foundation means less syntax and fewer special cases Some operations, such as equality testing, are really fast But it’s still weird! 10 Easier to write correct programs Programs are easier to write when all data is local Unit testing is easier, because there are no dependencies on functions that may or may not have been called previously When relevant values can be changed elsewhere in the program—possibly in many places—it’s harder to see all the connections Functions in a functional language get all relevant input from the parameter list There is no need for a setUp method Functional programming supports powerful operations on sequences The imperative and object-oriented programming styles have been characterized as “word by word” programming Some sequence operations, such as membership testing, are provided for you In a functional language, any function can be a sequence operation 11 The problem of state Nonfunctional programming language are “stateful” or “have state” The state of a program is given by (1) the values of all the variables throughout the program, and (2) the current locus of execution That can be a huge amount of information to keep track of! Object-oriented programmers try to control complexity by having objects be responsible for their own state, and “loosely coupled” (not very dependent on) other objects Methods often have “side effects,” that is, they modify state Functional languages try to avoid having state at all This isn’t always easy Purely functional languages cannot have side effects Since I/O is a side effect, this is an even more difficult restriction 12 Maintaining state, functionally Sometimes you just need state Consider an adventure game You need to keep track of where you are, where other objects are, what you are holding, which paths are blocked or open, etc.—this is your state You do not need to keep track of permanent, immutable data; for example, most paths between rooms are fixed and unchanging—this isn’t part of the state In a functional program, a “state” is just an immutable (and usually just one) data item The data item can be quite complex, such as a dictionary States are immutable, but you can always create a “new” state that is a variation of a given state With carefully designed data structures, not as much storage is required as you might expect So the functional solution to maintaining state is: Pass one state into a function, get a new (and different) state back! 13 Clojure’s I/O compromise A purely functional program has no side effects In Clojure, all functions return a value (print args) and (println args) return nil Clojure allows side effects in two well-defined places: I/O is a side effect Therefore: A purely functional program cannot do I/O! (do args) evaluates all its arguments in order, but returns only the value of the last one When a function (fn argv args) is called, the arguments are evaluated in order, but only the last value is returned Example: (defn powers "Computes cube and square" [] (def n (read)) (println (* n n n)) (println (* n n)) n) 14 Lists are immutable Here is a typical list: my-list A B C w Here is (cons 'w my-list) Here is (rest my-list) Notice that my-list remains unchanged Vectors, hash maps, sorted maps, hash sets, and sorted sets are similarly immutable 15 Functions are just values user=> (cons 'w '(a b c)) (w a b c) user=> (defn swap-args [f x y] (f y x)) #'user/swap-args user=> (swap-args cons '(a b c) 'w) (w a b c) user=> (defn apply-n-times [f x n] "Apply f to x, n times: f(f(f..(n)...))" (if (zero? n) x (apply-n-times f (f x) (dec n)) ) ) #'user/apply-n-times user=> (apply-n-times (fn [x] (* 2 x)) 1 10) 1024 16 Collatz, the hard way Definition: collatz(1) = 1 collatz(n) = collatz(n / 2) if n is even collatz(n) = collatz(3 * n + 1) user=> (defn collatz [n] (let [ do-even (fn [n] (collatz (/ n 2))) do-odd (fn [n] (collatz (inc (* 3 n)))) ] (print n " ") (if (= n 1) 1 (if (even? n) (do-even n) (do-odd n)) ) ) ) #'user/collatz user=> (collatz 7) 7 22 11 34 17 52 26 13 40 20 10 5 16 8 4 2 1 1 17 map, filter, and reduce Here are three powerful functions you will find in almost any functional programming language map – apply a function to every element of a sequence, returning a sequence of results filter – apply a predicate to every element of a sequence, returning a sequence of those that satisfy the predicate user=> (map even? '(3 1 4 1 6)) (false false true false true) user=> (filter even? '(3 1 4 1 6)) (4 6) reduce – use a function to reduce a sequence to a single value user=> (reduce * '(3 1 4 1 6)) 72 18 The real problem with state For decades we’ve been dealing with mutable state Mutable state + concurrency = nondeterminism We use threads and locks and semaphores and so on As Herb Sutter points out in The Free Lunch is Over, we have hit a 3 GHz barrier Since 2003, computers have not gotten faster We still want them faster Concurrency is the only solution Functional languages, with immutable state, provide a partial solution As Martin Odersky points out, you can hide from concurrency for a while yet…but not forever Important consequences: These are complicated, unsafe, and inefficient All newer languages are gaining functional and concurrent features Older languages, such as Java, are also trying to integrate these features “You can run, but you can’t hide!” 19 Oh, and by the way… Clojure has Infinite sequences Lazy sequences Exact decimal arithmetic Function composition Function currying Macros And lots more No version of Lisp has ever become mainstream They just get mugged in dark alleys and their ideas stolen! 20 The End 21