Clojure “Lisp Reloaded” Versions of LISP Lisp is an old language with many variants LISP is an acronym for List Processing language Lisp is alive and well today Most modern versions are based on Common Lisp Scheme is one of the major variants Racket is the latest version of Scheme Clojure is the latest in a long line of dialects Clojure uses a new approach to concurrency, Software Transactional Memory (STM) 2 Lisp syntax vs. Clojure syntax 3 Basic data types I Numbers: Clojure has integers, floating point numbers, and ratios Strings are enclosed in double quotes, and can contain escaped characters Within a string, \n represents a newline, as usual Character literals are written as \c to indicate the character c Integers include decimal integers (255), octals numbers (012), hexadecimal numbers (0xff) and radix numbers (2r1111), where the radix is an integer between 2 and 36, inclusive Floating point numbers include standard notation (3.1416) and scientific notation(1.35e-12). Ratios are “fractions” such as 1/3; they are not subject to roundoff error Since \n represents the character n, \newline is used to represent a newline Keywords are not reserved words, but are like “atoms” in Prolog or Erlang; a keyword begins with a colon (:foo) and stands for itself 4 Basic data types II A list is a sequence of values enclosed in parentheses, for example, (one 2 "buckle my shoe") Elements of the list are separated by whitespace or commas A list represents either data or a function call, depending on context A vector is a sequence of values enclosed in brackets, for example, [one 2 "buckle my shoe”] A map or hash is a sequence of key/value pairs, enclosed in braces, for example, {:ace 1, :deuce 2, "trey" 3} Elements of the vector are separated by whitespace or commas Elements are separated by whitespace or commas It is helpful to use commas between key/value pairs A set is a pound sign (#) followed sequence of values enclosed in braces, for example #{a b c} 5 Functions The syntax to define a named function is: (defn function_name docstring? [arguments] The docstring after the function name is optional The value of the function is the value of the last expression evaluated The syntax of a function call is (function_name arguments) expressions) Notice that the name of the function being called is the first thing inside the parentheses The syntax of an anonymous function, or lambda, is (fn [arguments] expressions) This syntax can be used in place of a function name Example: ((fn [lst] (first lst)) my-list) is equivalent to (first my-list) Alternative syntax: #(expressions), where %1 (or just %), %2, %3, … stand for parameters Since functions are first-class values, this syntax is convenient for creating a function to be used as an argument to another function 6 Sequences Lists, vectors, maps, and sets are all “sequences,” and many of the same operations apply (first seq) returns the first element of seq If the seq is empty, the empty list, (), is returned (rest seq) returns the seq without its first element If the seq is empty, the empty list, (), is returned (cons value seq) returns a sequence of the same type with the value inserted as the new first element Some sequences may be lazy; the elements of the sequence are not actually computed until they are used This allows Clojure to have infinite sequences, for example, “all positive integers” 7 Fast and slow operations Sequence is an interface roughly analogous to Java’s Iterator: It is a small interface (a couple of methods) that can be applied to a large range of data types Many (most?) of the common Clojure operations are defined on sequences For example, map and reduce work across almost all data types. Clojure.org describes sequences as “logical lists” The primary sequence operations are first, rest and cons Everything else is built around these operations. The set of cheap (constant time) operations are the same as for lists: Add/remove/access the first element or step to the next element Operations like nth and last are expensive (O(n) time) since they must traverse the entire list. count -- getting the length of a sequence -- is also expensive. 8 Sequence operations Sequence operations can be used on almost any compound type Collections: Vector, list, map, set, struct Strings (as char sequence) Java arrays, collections, sets, maps, iterator, enumerable IO streams Producer functions: Many lazy seqs do not get their values from a backing data set, but from a function that calculates the value when asked for. Some collections can be used directly as sequences, others need to be converted The rules are somewhat different for different kinds of collections, so the simplest general strategy is to convert to sequence, using seq do the processing if you need a particular non-sequence type (like vector or array), convert the return value back 9 Predicates A predicate (in any computer language) is a function that returns either “true” or “false” In Clojure, “false” is one of the atoms false or nil “true” is the atom true In addition, anything that isn’t false or nil is considered to be true In particular, the empty list, (), is “true” Hence, a predicate returns either false, nil, or any other value to mean “true” Predicates often return “true” values other than true, especially if the returned value might be useful 10 Function calls and data A function call is written as a list Example: (println "Hello" "World") The first element is the name of the function Remaining elements are the arguments Calls function println with arguments "Hello" and "World" The function prints Hello World and returns nil Data is written as atoms or lists Example: (println "Hello" "World") is a list of three elements 11 Quoting Is (f a b) a call to f, or is it just data? Lists and atoms must be quoted x indicates the variable x, while (quote x) is just the atom x (quote (f a b)) is the list (f a b) Exceptions include nil, true, and nil, which do not need to be quoted quote is not a function, but a special form The special form gets the unevaluated arguments, and is control of when or whether to evaluate them Special forms, unlike functions, are not first-class values '(f a b) is a shorthand way of doing the same thing There is just one single quote at the beginning It quotes one expression 12 Examples (defn my-first [x] (first x)) user=> (my-first '(1 2 3)) 1 defn my-second [lst] (first (rest lst))) user=> (my-second '(a b c)) b (defn my-third "This is a doc comment" [lst] (first (rest (rest lst))) ) user=> (my-third "Whatever") \a user=> (doc my-third) ------------------------user/my-third ([lst]) This is a doc comment nil 13 Arithmetic + * / Returns the sum of its arguments; (+) returns 0 Subtracts the rest of the numbers from the first number Returns the product of its arguments;(*) returns 1. Divides the rest of the numbers into the first number If operands are integers, result is a ratio If only one number, returns its inverse quot Returns the quot[ient] of integer division of the first number by the rest of the numbers rem remainder of dividing the first number by the second mod Modulus of first and second number; truncates toward negative infinity inc Returns a number one greater than its argument dec Returns a number one less than its argument max Returns the largest of its arguments min Returns the smallest of its arguments 14 Numeric comparisons = Returns true if all arguments are equal This is a true equality test, not an identity test Collections are compared in a type-independent manner; for example, (= '(1 2 3) [1 2 3]) returns true == Returns true if numeric arguments all have the same value, otherwise false not= Same as (not (= obj1 obj2)) < Returns true if numeric arguments are in monotonically increasing order > Returns true if numeric arguments are in monotonically decreasing order, otherwise false <= Returns true if numeric arguments are in monotonically non-decreasing order, otherwise false >= Returns true if numeric arguments are in monotonically non-increasing order, otherwise false 15 Conditional execution if is a special form that takes exactly three arguments: A predicate An expression to evaluate if the predicate is true An expression to evaluate if the predicate is false Syntax: (if Example: predicate true-branch false-branch) (defn sum [lst] (if (= lst ()) 0 (+ (first lst) (sum (rest lst))) ) ) (defn average [lst] (/ (sum lst) (count lst)) ) 16 The problem with if if expressions nest rather awkwardly (defn collatz [n] (println n) (if (= n 1) 1 (if (= (rem n 2) 0) (collatz (/ n 2)) (collatz (inc (* 3 n))) ) ) ) In most cases, the more general cond is preferred 17 cond cond implements the if...then...elseif...then...elseif...then... control structure Like if, cond is a special form, not a function That is, the arguments to cond are not evaluated before cond is called; rather, cond evaluates the arguments as it pleases 18 Syntax of the cond The syntax of the cond special form is: (cond condition result condition result ...) Example: (defn pos-neg-or-zero "Determines whether n is positive, negative, or zero" [n] (cond (< n 0) "negative" (> n 0) "positive" :else "zero" ) ) The last condition, :else, is true, and reads better than the atom true 19 Rules for Recursion Handle the base (“simplest”) cases first Recur only with a “simpler” case “Simpler” = more like the base case Don’t alter global variables Don’t look down into the recursion 20 Example: member As an example we define member, to test membership in a list (defn member [a lat] (cond (empty? lat) false (= a (first lat)) true :else (member a (rest lat)) ) ) user=> (member :b '(:a :b :c)) true 21 Guidelines for recursive Clojure functions Unless the function is trivial, use a cond Handle the base case(s) first Avoid having more than one base case The base case is often testing for an empty list Do something with the first and recur with the rest Use tail recursion wherever possible (To be explained later) 22 Example: union (defn union [set1 set2] (cond (empty? set1) set2 (member (first set1) set2) (union (rest set1) set2) :else (cons (first set1) (union (rest set1) set2)) ) ) This example uses the previously defined member function It appears that, if function A calls function B, Clojure requires function B to be defined before defining function A Of all the languages I have used, only C and its variants make this requirement Unless I’m missing something, this is a real step backward for Lisp dialects 23 Tests The following can be used with any type of argument nil? identical? Returns true if its argument is nil, false otherwise. Tests if its two arguments are the same object The following will throw an exception if given a non-numeric argument zero? pos? neg? even? odd? Returns true if its argument is zero, else false Returns true if its argument is greater than zero Returns true if its argument is less than zero Returns true if its argument is even Returns true if its argument is odd 24 Type tests coll? seq? vector? list? map? set? Returns true if x implements IPersistentCollection Return true if x implements ISeq Return true if x implements IPersistentVector Returns true if x implements IPersistentList Return true if x implements IPersistentMap Returns true if x implements IPersistentSet 25 Content Tests (contains? map key) Returns true if key is present in the map (distinct? args) Returns true if no two of the arguments are = (empty? collection) Returns true if collection has no items - same as (not (seq coll)) Clojure programmers use the idiom (seq x) rather than (not (empty? x)) (every? predicate collection) Returns true if (predicate x) is true for every x in collection (not-every? predicate collection) Returns false if (predicate x) is true for every x in collection (not-any? predicate collection) Returns false if (predicate x) is true for any x in collection (some predicate collection) Returns the first value in the collection for which the predicate is true 26 I/O *in* *out* *err* print, pr printf println prn newline read-line slurp spit A java.io.Reader object representing standard input A java.io.Writer object representing standard output A java.io.Writer object representing standard error Prints the object(s) to the *out* output stream, separated by spaces Prints formatted output, according to a format Same as print followed by newline Same as pr followed by newline. Observes *flush-on-newline* Writes a newline to the output stream *out* Reads the next line from stream *in* Reads the file into a string and returns it Opposite of slurp. Opens file with writer, writes content, then closes it 27 Dealing with side effects Functions compute values; but printing is a side effect The value returned by println is nil—not too useful You can put multiple expressions in a defn The result of calling the function is the result of evaluating the last expression (defn five [] (println "Just five") 5 ) (five) Just five 5 You can also use do; the result is the value of the last expression (do (println "one" "two") (+ 1 2) ) one two 3 28 REPL commands (load-file filename) loads a file containing Clojure functions *1 bound in a repl thread to the most recent value printed *2 bound in a repl thread to the second most recent value printed *3 bound in a repl thread to the third most recent value printed *e bound in a repl thread to the most recent exception caught by the repl *print-dup* When set to logical true, objects will be printed in a way that preserves their type when read in later *print-length* controls how many items of each collection the print *print-level* controls how many levels deep the printer will print *print-meta* If set to logical true, when printing an object, its metadata will also be printed in a form that can be read back by the reader *print-readably* When set to logical false, strings and characters will be printed with non-alphanumeric characters converted to the appropriate escape sequences 29 Dealing with the REPL The last time I used Clojure’s REPL, it seemed to be very buggy I got a lot of errors along these lines: java.lang.Exception: Unable to resolve symbol: union in this context (NO_SOURCE_FILE:43) Here are two workarounds that seem to solve most such problems: (It’s a Java exception because Clojure is compiled to the JVM) Start a new Clojure shell, or Just copy the function definition and paste it directly into the Clojure shell That was a year ago; the new version may be better You might also try out clooj, which is a very simple Clojure IDE https://github.com/arthuredelstein/clooj 30 The End 31