ΑΡΙΘΜΟΣ: Common Lisp

Showing posts with label Common Lisp. Show all posts

Wednesday, July 27, 2022

Using CLSQL in LispWorks 8 (x64)

A few modifications are needed to use CLSQL in LispWorks 8.0. The core changes needed are in ...\quicklisp\dists\quicklisp\software\clsql-20210228-git\uffi\clsql-uffi.lisp. I changed the returning clauses in each of the following (and the function names for #-windows conditions):

#-windows

(uffi:def-function ("_strtoui64" c-strtoull)

((str (* :unsigned-char))

(endptr (* :unsigned-char))

(radix :int))

:returning :uint64)

#-windows

(uffi:def-function ("_strtoi64" c-strtoll)

((str (* :unsigned-char))

(endptr (* :unsigned-char))

(radix :int))

:returning :int64)

#+windows

(uffi:def-function ("_strtoui64" c-strtoull)

((str (* :unsigned-char))

(endptr (* :unsigned-char))

(radix :int))

:returning :uint64)

#+windows

(uffi:def-function ("_strtoi64" c-strtoll)

((str (* :unsigned-char))

(endptr (* :unsigned-char))

(radix :int))

:returning :int64)

LispWorks didn't know what to do with the :unsigned-long-long.

Two other things to try to make sure you are ready to move forward with your databases. If you are using SQLite3 or MySQL, you will need to locate the required dlls (or download them). Use

(push (pathname "C:/path/to/x64")

clsql::*FOREIGN-LIBRARY-SEARCH-PATHS*)

to allow finding the paths to the dlls. Do this for each of the required dlls.

Each type of database can be loaded according to Database Back-ends. For example, SQLite3:

    (asdf:operate 'asdf:load-op 'clsql-sqlite3)

For an asd file that needs SQLite3, this is what I'm currently using after a call to (ql:quickload :clsql) before defsystem:

After attempting to use the create-view-from-class function and finding that it created tables without foreign keys, I decided to use a simple SQL create script. It appears that execute-command only executes the first SQL statement, so it is necessary to split on the semi-colon (;) and then run each statement in sequence. (I am skipping the last "statement" in my results because my script has some gobble-dee-gook at the end.)

Friday, February 12, 2021

EMACS with Slime on Chromebook

Because of a felt-insufficiency of nerdness in my life, I decided it was high time I get set up to use slime in EMACS on a Chromebook. I have been an EMACS user on Windows, using it for my first experience of Common Lisp development. I now use LispWorks on my Windows machine but if there was a way to do Common Lisp on my Chromebook I thought I would hazard some of my time to learn how to go about it.

First, get Linux on your Chromebook. For newer systems, this is as simple as turning on Linux Beta in the user settings of your Chromebook. I don't know how well this works on older Chromebooks, but I understand there are ways.

With Linux enabled or installed as required, go to the terminal (settings key, type linux or terminal and you should see it) and type sudo apt-get install emacs25. I had to run an update to apt-get for this to work properly (sudo apt-get update). Then I redid the emacs25 install (same command line) and it caught up the parts it missed the first time.

Similarly, you can install a version of Common Lisp. I chose Steel Bank Common Lisp (sbcl). The command line for that is (perhaps predictably) sudo apt-get install sbcl.

Quicklisp is next. I downloaded it from here. I moved it into my Linux files because it seemed like a good idea off hand, though I'm not sure that it is strictly necessary. Before trying to load this, I checked out the (rather simple) file structure on my Linux system from the terminal perspective. Use the ls command to see the files in the current directory and the cd command to change directory (I'm not a real Linux user, maybe you're not either). Next, I started sbcl by typing it at the command line. This results in bringing you to a command line version of sbcl where you can type Common Lisp commands. We use this to load Quicklisp. I issue the following, pressing enter after each line:

(load "quicklisp.lisp")

(quicklisp-quickstart:install)

(ql:add-to-init-file)

(ql:quickload "quicklisp-slime-helper")

(quit)

Now, you need some code added to the the init file, but I didn't see such a file and wasn't sure where to put it. Fortunately, there's a way for EMACS to help you with that. Go into EMACS and type CTRL+x CTRL+f, it will assume you want the home directory and lead you with "~/", and so you add to that emacs.d/init.el and press enter. Or, in EMACSese, that's C-x C-f ~/emacs.d/init.el RET. HT to the Wiki. This will create and open the file for you.

The code you will want to put in this file is suggested at the end of installing the quicklisp-slime-helper and it is this:

(load (expand-file-name "~/quicklisp/slime-helper.el"))

(setq inferior-lisp-program "sbcl")

Paste the code into the file you just opened. Now, be prepared for a "wha?" moment. The shortcut sequence for pasting into EMACS is CTRL+y. Or, C-y as the EMACS users like to say. This will not be your last such moment, but carry on.

You are now prepared to start writing and executing Common Lisp code in EMACS. In the below, C- means press and hold CTRL and M- means press and hold ALT. I do the following sequence as a standard way to get started:

C-x 3 (gets you set with vertical division--left and right panes. one for code file and one for REPL)

M-x slime RET (starts the REPL)

Here is the classic video tutorial (HT Baggers) of most of these steps done for Windows that also gives a bit of a flavor of what it is like to use Slime in EMACS for Common Lisp development: Installing Common Lisp, Emacs, Slime & Quicklisp.

Saturday, February 8, 2020

Cluster / Hierarchical Hashing in Common Lisp

I had some application ideas where I wanted to group data by several keys. Sometimes I care about the hierarchical relationship and sometimes I don't. As an example, let's take my FitNotes exercise data. Here's a few records from way back in the day (I can hardly believe the numbers, ugh...a few years and many pounds ago):

Date	Exercise	Category	Weight (lbs)	Reps
2015-03-10	Deadlift	Back	130	10
2015-03-10	Deadlift	Back	130	10
2015-03-10	Deadlift	Back	130	10
2015-03-10	Chin Up	Back	205	4
2015-03-10	One-Arm Standing Dumbbell Press	Shoulders	25	12
2015-03-10	One-Arm Standing Dumbbell Press	Shoulders	25	12
2015-03-10	One-Arm Standing Dumbbell Press	Shoulders	25	12
2015-03-10	Flat Dumbbell Fly	Chest	30	10
2015-03-10	Flat Dumbbell Fly	Chest	30	10
2015-03-14	Deadlift	Back	135	10

If I wanted to show a history of data and provide the user with the option of seeing a graph of their progress (similar to what the FitNotes application does) or perhaps a list of dates to select from to see the details on that date, then I would group first by Exercise and then by date. Depending on the purpose of the grouping, I might want to group by Category first. But let's go with a graphing application for a particular exercise. I want to group by Exercise and then by date.

I create hash tables at each level except the bottom level where I create a queue. The main thing that interested me here was the setf function, since the syntax bamboozled me the first time I saw in Practical Common Lisp (it's not hard, just different). I ended up not needing it for my current application, but may want it another day. For the kind of data I'm working with you need to use the equal test in order to get your hash tables to work with strings. If you compare (setf get-cluster-hash) with add-to-cluster-hash, you'll notice they are very similar. I was tempted to make add-to-cluster-hash out of the setf and get-cluster-hash functions but realized that that entails evaluating the hashes twice for adding the first record to a cluster-key. There might be a way to eliminate the code duplication, but I gave in to copy-paste-modify. (Please don't hate me.)

(defun make-queue ()
  (let ((li (cons nil nil)))
    (setf (car li) (cons nil nil))
    (setf (cdr li) (car li))
    li))

;; we add to the end and maintain the pointer
(defun add-queue (queue item)
  (setf (cddr queue) (cons item nil))
  (setf (cdr queue) (cddr queue))
  queue)

(defun queue->list (queue)
  (cdar queue))

(defun (setf get-cluster-hash) (value keys hash)
  (let* ((f (first keys))
         (r (rest keys)))
    (cond
     ((null f)
      nil)
     ((null r) 
      (setf (gethash f hash) value))
     (t
      (let* ((f-hash (gethash f hash)))
        (unless f-hash
          (setf f-hash (make-hash-table :test #'equal))
          (setf (gethash f hash) f-hash))
        (setf (get-cluster-hash r f-hash) value))))))

(defun get-cluster-hash (keys hash)
  (loop for k in keys
        for k-hash = (gethash k hash) then (gethash k k-hash)
        while k-hash
        finally (return k-hash)))

(defun add-to-cluster-hash (keys hash value)
  (let* ((f (first keys))
         (r (rest keys)))
    (cond
     ((null f)
      nil)
     ((null r) 
      (let ((q (gethash f hash)))
        (if q
            (add-queue q value)
          (let ((nq (make-queue)))
            (add-queue nq value)
            (setf (gethash f hash) nq)))))
     (t
      (let* ((f-hash (gethash f hash)))
        (unless f-hash
          (setf f-hash (make-hash-table :test #'equal))
          (setf (gethash f hash) f-hash))
        (add-to-cluster-hash r f-hash value))))))

Here is a simple demonstration REPL session with some of these functions:

Since I have the data of interest in a CSV file exported from FitNotes, I can just read every line of the CSV file and use add-to-cluster-hash for each line, thus grouping my data without loosing the order of the items for each (exercise, date), and all without thinking much about the details of the data structure used to do the grouping.

Friday, January 3, 2020

Macro Expansion Time: Connection Strings

Connection strings bother me. I'm not saying that aren't a good solution or that they shouldn't be, but I don't like having to look up special values to fill in and so on. I think the reason connection strings exist is that the available options for drivers, servers, and several other settings is unknown. There is no definitive list of all the potential types of things that need to go into a connection string. There are, however, common patterns of connection strings and values that are normally required. There's a website devoted to connection strings (https://www.connectionstrings.com/) and as far as being a very generic resource for helping you with your connection string needs, this goes a long way.

But it would nice to be able to maintain a list of connection string values that apply to various connections or connection types that matter to me so that after I have learned the values I need for certain types of connections I can represent them with a keyword and if I use a bad keyword (e.g., typo), get a warning at compile time of that fact. In the process of starting a very basic macro framework for doing this, I ran into a bump in my understanding of the macro expansion process and I thought that made it worth sharing more so than the connection string macros themselves. I'm learning to use the clsql package (available with quicklisp), which comes into play with one of the macros.

Two notable features that these macros will demonstrate are:

the explicit expansion of macros within code that produces code
conditional code production based on the type of data supplied

In particular, variables versus literals versus keywords

The first thing I want is to have my own repository of special values for each type of information that I care about. For my present interests I am concerned with drivers and servers. You might add to this specific databases. We don't want to get stuck with having to use keywords only because this would require having every case we will ever care about in the repository or require the repository to be updated before you can call the macro, which largely defeats the intended convenience of the macros. We also don't want the look up to happen during run-time where we can avoid it so that our convenience function doesn't add operations to run-time that would have been unnecessary if we had used a less convenient literal connection string. (Sometimes we have to decide between clarity/convenience of expression on the one hand and efficiency on the other. In this case, we will try to have our cake and eat it too.)

The way we handle our repositories is with a plist and the macros for each category of thing could be similar. I show only the drivers one since the server version is not materially different. (That probably means I could take the abstraction another level to remove the repetition.)

If a literal keyword is supplied, we will get the magic string that it corresponds to. We assume that if the user has supplied a keyword that isn't in the list, that they have made a mistake and therefore issue an error. This error is raised during macro expansion time which happens prior to compilation. If it is not a keyword, then we don't try to determine anything about its validity—we let the caller take their chances. It might be a literal string or it might be a variable, but that isn't this macro's problem.

Our next macro has some more interesting features, although it is basically a glorified concatenation. One notable feature is that we will pass on requiring keyword arguments and simply use &rest and recognize what we can and let everything else pass through without a lot of scrutiny. This is a choice to avoid doing research to support cases for my abstraction that I may never use. As the maintainer of this code for myself, I can always update this macro to support additional cases in the cond statement when I come across a new requirement that I actually want to use. In general, we are expecting keywords followed by either values or keywords that apply to the keywords, but we can accept strings in place of any of the keywords. We take consecutive pairs of parameters and consider the first of a pair to be the left hand of the = and second of the pair to be on the right of the = in our connection string.

(defmacro make-connection-string (&rest rest)
  (destructuring-bind (bound-list unbound-list)
      (loop for pair on rest by #'cddr
            for a = (car pair)
            for b = (cadr pair)
            for (a-name b-value) =
            (cond
	     ((eql a :driver) (list "Driver" (macroexpand `(typical-drivers ,b))))
	     ((eql a :server) (list "Server" (macroexpand `(typical-servers ,b))))
	     ((eql a :trusted) (list "Trusted_Connection"
				     (if b "Yes" "No")))
	     (t (list (format nil "~a" a) b)))
            if (or (not (symbolp b-value))
                   (boundp b-value)) collect (cons a-name b-value) into bound
            else collect (cons a-name b-value) into unbound
            finally (return (list bound unbound)))
    (let ((bound-part
	   (apply #'concatenate 'string
		  (mapcar #'(lambda(u) (format nil "~a=~a;" (car u) (cdr u)))
			  bound-list))))
      (cond
       ((null unbound-list) bound-part)
       (t
        `(concatenate 'string
                      ,bound-part
                      (concatenate 'string
                                   (mapcar
                                    #'(lambda(u) (format nil "~a=~a" (car u) (cdr u)))
                                    ,unbound-list))))))))

Here's a sample usage:

(Note that you can use the function write-line to convince yourself about the rectitude of the escaped backslash.)

First note the explicit calls to macroexpand with the macro. The key to understanding why this is necessary it to think about the order of execution. Suppose instead of (list "Driver" (macroexpand `(typical-drivers ,b))) we wrote simply (list "Driver" (typical-drivers b)). When the macro is expanded, it expands the macro typical-drivers without having a specific value for b supplied and as such, it is an unbound variable called b which gets passed into typical-drivers. So, the code (typical-drivers b) gets expanded to be just b. Not very useful, is it? The macroexpand wrapper allows us to delay the execution of the expansion until b is bound to a value and the , notation puts the bound value of b in here and we get the desired result.

As we go through the list of pairs, we restructure them into an association list of (left side, right side) cons cells. We want to distinguish between two kinds of pairs. Bound and non-bound. The reason this distinction matters to us is that the bound items already have values and we can put them in place right away so that we don't have to wait until later to do the final concatenation for this part of the code. If there is no unbound part, then we will have a simple, complete connection string that will in our compiled code. If there is an unbound part, then we need to return code that concatenates the pieces. It is the responsibility of the caller to provide a way for the supplied variables to be bound to a value before entering into the code that is thus generated. A wrinkle in my terminology is that bound is something that applies to symbols. If I pass in something that is not a symbol (which I will do often), then I am going to assume, for simplicity, that it is a concrete and directly usable value and hence "bound" in the sense of this macro.

Finally, my last macro is the usage of this connection string stuff to connect to a database using clsql, where my own original exercise began:

Friday, May 24, 2019

Recursion and Accumulators

Working through Paradigms in Artificial Intelligence Programming, I have worked to wrap my head around accumulators and I have implemented one that worked surprisingly quickly. My example is a factorial design experiment. I have 4 reference implementations of a factorial design on an experiment with 7 factors, with each factor having 10 treatments in it. This is a bit of a ridiculous size, but it gave me clear separation of performances. I have split the code up into sections so you can focus on the parts of the program that matter you.

Supporting Code

(defmacro make-enumerator (head tail)
  `(cons ,head #'(lambda () ,tail)))

(defun get-current (enumerator)
  (if (consp enumerator)
      (first enumerator)
      nil))

(defun get-next (enumerator)
  (when (functionp (rest enumerator))
      (funcall (rest enumerator))))

(defvar empty-indices '(nil))

;; this is a non-destructive implmentation
(defun multiple-radix-counter (radices &key indices terminate)
  (when (null indices)
    (setf indices (make-array (length radices) :initial-element 0)))
  (if (and terminate (equal indices empty-indices))
      nil
      (make-enumerator
       indices
       (multiple-radix-counter
 radices
 :indices
 (loop with dest = (make-array (length indices) :initial-contents indices)
    for i from (1- (length indices)) downto 0 
    if (< (1+ (aref dest i)) (aref radices i)) do
      (incf (aref dest i))
      (return dest)
    else do
      (setf (aref dest i) 0)
    finally
      (return (if (and terminate (< i 0))
    empty-indices
    dest)))
 :terminate terminate))))

(defun extract-by-multiple-radix (source indices)
  (loop with res = (make-array (length indices))
     for i from 0 to (1- (length indices))
     do (setf (aref res i) (aref (aref source i) (aref indices i)))
     finally (return res)))

(defmacro discard (&body body)
  `(let ()
     ,@body
     nil))

Main Functions

(defun factorialize (factors &optional (accumulator '(nil)))
  "Given a list of lists, where each list is of treatments, give a complete factorial."
  (cond ((null factors) accumulator)
 ((atom factors) (mapcar #'(lambda (u) (cons factors u)) accumulator))
 ((atom (first factors))
  (loop with res = '(nil)
     with ending = res
     for f in factors do
       (loop for acc in accumulator do
     (setf (cdr ending) (cons (cons f acc) nil))
     (setf ending (cdr ending)))
     finally
       (pop res)
       (return res)))
 (t (factorialize (first factors)
    (factorialize (rest factors) accumulator))))

(defun factorialize-2 (factors)
  "This is the slow version using append.
sample usage: (factorialize-2 '((1 2 3) (4 5 6) (a b c)))
produces: ((1 4 A) (1 4 B) (1 4 C) (1 5 A) (1 5 B) (1 5 C) (1 6 A) (1 6 B) (1 6 C) (2 4 A) (2 4 B) (2 4 C) (2 5 A) (2 5 B) (2 5 C) (2 6 A) (2 6 B) (2 6 C) (3 4 A) (3 4 B) (3 4 C) (3 5 A) (3 5 B) (3 5 C) (3 6 A) (3 6 B) (3 6 C))"
  (cond ((null factors) nil)
 ((null (rest factors)) (mapcar #'(lambda (u) (list u)) (first factors)))
 (t (loop with remaining = (factorialize-2 (rest factors))
       for f in (first factors) append
  (loop for r in remaining collect (cons f r))))))

(defun factorialize-3 ()
  (loop with res = '(nil)
     with ending = res
     for i from 0 to 9 do
       (loop for j from 0 to 9 do
     (loop for k from 0 to 9 do
   (loop for l from 0 to 9 do
        (loop for m from 0 to 9 do
      (loop for n from 0 to 9 do
    (loop for p from 0 to 9 do
         (setf (cdr ending) (cons (list i j k l m n p) nil))
         (setf ending (cdr ending))))))))
     finally
       (pop res)
       (return res)))

(defun factorialize-4 (factors)
  (let* ((radices (map 'vector #'length factors))
  (combinations (reduce #'* radices))
  (res (make-array (list combinations))))
    (loop for enum = (multiple-radix-counter radices :terminate t) then (get-next enum)
       for index = 0 then (1+ index)
       for indices = (get-current enum)
       while enum
       do (setf (aref res index) (extract-by-multiple-radix factors indices))
       finally
  (return res))))

Speed Testing

To make the testing as fair as possible, I started emacs fresh for each test run. Under these conditions, factorialize took 2.5 seconds, factorialize-2 took 3.7 seconds, factorialize-3 took 22 seconds, and factorialize-4 took 12 seconds.

A large amount of the time spent in these examples is spend in garbage collection. The factorialize-4 example could probably be improved by allocating a large chunk of memory in a 2-dimensional array and then the rest of the work involves a lot less memory allocation time. The land slide difference that marks factorialize off from the other methods appears to be the amount of shared memory it involves. Some reassignments of various locations of the resulting list, produce changes in other lists since there is shared cons cells among the lists. For my particular application this was proving to be a problem and so I have changed over to the factorialize-4 version of the function. Since my actual likely data is much smaller than that used in the speed tests of this post, I'm not overly concerned about the difference in speed.

An advantage of the method of factorialize-4, which uses an enumerator is that this method could be used to output data to a database or file in a way that didn't need to use very much run-time memory. Very likely in these cases, the time spent sending and saving data to persistent storage is much greater than the computational effort. The factorialize and factorialize-2 methods do not lend themselves to this as they do not generate complete tuples until all of the tuples are created. factorialize-4 creates complete tuples one at at time and these can be acted on immediately without waiting for all of the output to be generated.

Monday, May 6, 2019

Common Lisp Pipes→Enumerators

I've been plodding along through Peter Norvig's Paradigm's in Artificial Intelligence Programming and finding some helpful tips along the way. I especially enjoyed the discussion of pipes in chapter 9. I was intrigued by the pipes implementation of the sieve or Eratosthenes and wondered what else I could do with pipes.

Something that I didn't like about pipes is that they keep around a list of the results, when maybe you are actually done with those results. I was taking combinations of factors to produce a factorial experimental design and I decided to do so using something I have called multiple-radix-counting. Multiple-radix, meaning the radix (or base) for each digit is different. Imagine an odometer where each spinner has a different number of marks on it. Instead of always going from 0 to 9, some of the spinners might be 0 to 1 or 0 to 5. The sequencing is the same. When you get the end of the right most spinner, you wrap around on the first and advance the next spinner. Here's some sample output the shows the pattern:

I have taken the definitions from PIAP and modified them to avoid hanging on to the part of the pipe that has gone past. I have renamed certain functions/macros and made minor changes to them to produce a result that behaves differently. Instead of make-pipe, I have make-enumerator, although the macro is identical. It's important that I have a new name for this macro that suits the different usage in case I decide to implement enumerators differently in the future. The function get-current and get-next are the replacements for head and tail, respectively. The representation is very similar to that used for pipes. I use a cons cell where the car is the current value and the cdr is a function call.

(defmacro make-enumerator (head tail)
  `(cons ,head #'(lambda () ,tail)))

(defun get-current (enumerator)
  (if (consp enumerator)
      (first enumerator)
      nil))

(defun get-next (enumerator)
  (when (functionp (rest enumerator))
      (funcall (rest enumerator))))

(defun integer-enum (&optional (start 0) end)
  (if (or (null end) (<= start end))
      (make-enumerator start (integer-enum (1+ start) end))
      nil))

(defvar empty-indices '(nil))

The purpose of the extract-by-multiple-radix is to take a selection out of source (an array of arrays) based on the indices that were specified. If you were doing multiplication of multiple term factors, you would take the terms by index from each factor and multiply them. To get the factors based on index, you would use extract-by-multiple-radix.

ΑΡΙΘΜΟΣ

Pages

Wednesday, July 27, 2022

Using CLSQL in LispWorks 8 (x64)

Friday, February 12, 2021

EMACS with Slime on Chromebook

Saturday, February 8, 2020

Cluster / Hierarchical Hashing in Common Lisp

Friday, January 3, 2020

Macro Expansion Time: Connection Strings

Friday, May 24, 2019

Recursion and Accumulators

Supporting Code

Main Functions

Speed Testing

Monday, May 6, 2019

Common Lisp Pipes→Enumerators

Blog Archive

Most Viewed