Postgresql
Description
Bindings for PostgreSQL's C-api.
Author
Original author: Johannes Grødem
Please do not mail to Johannes directly as he no longer develops this egg.
Current maintainer: Peter Bex
Thanks to
- Felix L. Winkelmann
- Alex Shinn
- Ed Watkeys
- Taylor Campbell
Repository
https://code.more-magic.net/chicken-postgresql
Requirements
You will also need to have libpq installed, including development headers.
Documentation
This extension provides an interface to the PostgreSQL relational database.
Connection management
[procedure] (connect [CONNECTION-SPEC [TYPE-PARSERS [TYPE-UNPARSERS [NOTIFY-HANDLER]]]])Opens a connection to the database given in CONNECTION-SPEC, which should be either a PostgreSQL connection string or an alist with entries consisting of a symbol and a value. The symbols should be connection keywords recognized by PostgreSQL's connection function. See the list of PQconnectdbParams parameter keywords in the PostgreSQL documentation. At the time of writing, they are host, hostaddr, port, dbname, user, password, connect_timeout, options, sslmode, service.
If any parameter is not specified, or the CONNECTION-SPEC was not supplied at all, the regular libpq rules apply. That means it falls back to checking various environment variables and settings from pg_service.conf, as well as passwords from pgpass. This is to be preferred over hardcoding connection settings in your code.
Using an alist for CONNECTION-SPEC is recommended; when available (when using libpq from Postgres 9.0 or later), PQconnectStartParams will be used. This prevents parsing errors when keys or values contain "special" characters like equals signs or single quotes. This also adds a layer of security for when connection specifier components come from an untrusted source.
TYPE-PARSERS is an optional alist that maps PostgreSQL type names to parser procedures, TYPE-UNPARSERS is an optional alist that maps predicates to unparser procedures. They default to (default-type-parsers) and (default-type-unparsers), respectively (see below).
NOTIFY-HANDLER is an optional procedure which should accept three arguments: A channel name (string), the triggering backend's PID (integer) and the payload (string). It is invoked whenever there are asynchronous notifications available through LISTEN. It defaults to (default-notify-handler) (see below). It may also be #f to disable handling of notifications.
The return value is a connection-object.
Important: You cannot use the same connection from multiple threads. If you need to talk to the same server from different threads, simply create a second connection.
Also note that while these bindings use the non-blocking interface to connect to PostgreSQL, if you specify a hostname (using the host-keyword), the function might not be able to yield because the resolver will block. If you don't want this, you'd have to call some kind of custom asynchronous resolver.
[procedure] (disconnect CONNECTION)Disconnects from the given CONNECTION.
[procedure] (reset-connection CONNECTION)Resets, that is, reopens the connection with the same connection-specs as was given when opening the original CONNECTION.
[procedure] (type-parsers CONNECTION)Retrieve the alist of type parsers associated with the CONNECTION.
[procedure] (type-unparsers CONNECTION)Retrieve the alist of type unparsers associated with the CONNECTION.
[procedure] (connection? OBJECT)Returns true if OBJECT is a PostgreSQL connection-object.
[procedure] (connected? CONNECTION)Returns true if the CONNECTION represents an open connection.
Query procedures
[procedure] (query CONN QUERY . PARAMS)Execute QUERY on connection CONN and return a result object. The result object can be read out by several procedures, ranging from the low-level value-at to the high-level row-fold. See the High-level API and Low-level result API sections below for information on how to read out result objects.
QUERY can either be a string or a symbol. If it is a string, it should contain an SQL query to execute. This query can contain placeholders like $1, $2 etc, which refer to positional arguments in PARAMS. For example:
(import postgresql) (let ((conn (connect '((dbname . test))))) (row-values (query conn "SELECT $1::text, 2::int2" "hello"))) => ("hello" 2)
If QUERY is a symbol, it must match the name of a prepared statement you created earlier. The number of parameters passed as PARAMS must match the number of placeholders used when the statement was prepared.
To actually create a prepared statement, you can simply use the query procedure with an SQL PREPARE statement. The placeholders in that statement are deferred until execute time. It allows no parameters to be sent at preparation time, which is a limitation in the PostgreSQL protocol itself. You could use escape-string if you really must pass in dynamic values when preparing the statement.
(import postgresql) (let ((conn (connect '((dbname . test))))) ;; Can't pass any arguments here: (query conn "PREPARE mystmt (text, int) AS SELECT $1, $2") ;; They are deferred until statement execution time: (row-values (query conn 'mystmt "hi" 42))) => ("hi" 42) ; a list of all values at row 0 (see "row-values" below) (let ((conn (connect '((dbname . test))))) ;; If we absolutely need dynamic values, we can escape them manually: (query conn (sprintf "PREPARE mystmt (text) AS SELECT $1, '~A', 'bar'" (escape-string conn "foo"))) (row-values (query conn 'mystmt "hi"))) => ("hi" "foo" "bar")
As you can see from the examples above, PostgreSQL types are automatically converted to corresponding Scheme types. This can be extended to support your own user-defined types, see the section about type-conversion below for more information on how to do that.
[procedure] (query* CONN QUERY [PARAMS] [format: FORMAT] [raw: RAW?])A less convenient but slightly more powerful version of the query procedure; PARAMS must now be a list (instead of rest-arguments). FORMAT is a symbol specifying how to return the resultset: either as binary or text (the default). RAW is a boolean which defines whether the PARAMS should be treated "raw" or first passed through the unparsers associated with CONN. If they are treated "raw", they must all be strings, bytevectors or sql-null objects.
See type conversion for more info on unparsers.
(import postgresql (scheme base)) (let ((conn (connect '((dbname . test))))) (row-map* (lambda (a b) (list (utf8->string a) b)) (query* conn "SELECT $1::text, 2::int2" '("hello") format: 'binary))) => (("hello" #u8(0 2)))
High-level API
Usually you will use only these procedures to process result sets, but you can fall back to (or even mix with) the low-level API if you need to do custom things.
[procedure] (row-fold KONS KNIL RESULT)[procedure] (row-fold* KONS KNIL RESULT)
This is the fundamental result set iterator. It calls (kons row seed) for every row, where row is the list of values in the current row and seed is the accumulated result from previous calls (initially knil), ie its pattern looks like (KONS ROWN ... (KONS ROW2 (KONS ROW1 KNIL))). It returns the final accumulated result.
The starred version works the same, except it calls (kons rowN-col1 rowN-col2 ... seed) instead of (kons rowN seed), so the procedure must know how many columns you have in the result set.
(import postgresql) (let ((conn (connect '((dbname . test))))) (row-fold (lambda (row sum) (+ (car row) sum)) 0 (query conn "SELECT 1 UNION SELECT 2"))) => 3 (let ((conn (connect '((dbname . test))))) (row-fold* (lambda (value str) (string-append str value)) "" (query conn "SELECT 'hello, ' UNION SELECT 'world'"))) => "hello, world"[procedure] (column-fold KONS KNIL RESULT)
[procedure] (column-fold* KONS KNIL RESULT)
As row-fold/row-fold*, except this iterates sideways through the columns instead of lengthways through the columns, calling KONS with all values in all the rows of the current column, from left to right.
The starred version is much less useful here since you often don't know the number of returned columns, but it is provided for consistency.
(import postgresql) (let ((conn (connect '((dbname . test))))) (column-fold (lambda (col sum) (+ (car col) sum)) 0 (query conn "SELECT 1, 100 UNION SELECT 2, 200"))) => 101[procedure] (row-fold-right KONS KNIL RESULT)
[procedure] (row-fold-right* KONS KNIL RESULT)
The fundamental result set recursion operator; Calls (KONS COL1 (KONS COL2 (KONS ... KNIL))) instead of (KONS COLN ... (KONS COL2 (KONS COL1 KNIL))).
[procedure] (column-fold-right KONS KNIL RESULT)[procedure] (column-fold-right* KONS KNIL RESULT)
Column variants of row-fold-right/row-fold-right*.
[procedure] (row-for-each PROC RESULT)[procedure] (row-for-each* PROC RESULT)
Simple for-each, calling the (PROC row) on each row, in turn, only for the purpose of its side-effects. The starred version calls (PROC col1 col2 ...).
(import postgresql) (let ((conn (connect '((dbname . test))))) (row-for-each write (query conn "SELECT 1, 100 UNION SELECT 2, 200"))) ; prints: (1 100) (2 200)[procedure] (column-for-each PROC RESULT)
[procedure] (column-for-each* PROC RESULT)
Column variants of row-for-each/row-for-each*.
[procedure] (row-map PROC RESULT)[procedure] (row-map* PROC RESULT)
Maps rows to lists by applying PROC to every row and using its result in the result list on the position corresponding to that of the row. This procedure is not guaranteed to walk the result set in any particular order, so do not rely on the order PROC will be called.
(import postgresql) (let ((conn (connect '((dbname . test))))) (row-map* + (query conn "SELECT 1, 100 UNION SELECT 2, 200"))) => (101 202)[procedure] (column-map PROC RESULT)
[procedure] (column-map* PROC RESULT)
Column variants of row-map/row-map*.
(import postgresql) (let ((conn (connect '((dbname . test))))) (column-map* + (query conn "SELECT 1, 100 UNION SELECT 2, 200"))) => (3 300)
Transaction management
[procedure] (with-transaction CONN THUNK [isolation: LEVEL] [access: MODE])Execute THUNK within a BEGIN TRANSACTION block, and return the value of thunk.
The transaction is committed if thunk returns a true value. If an exception occurs during thunk, or thunk returns #f, or the commit fails, the transaction will be rolled back. If this rollback fails, that is a critical error and you should likely abort.
Nested applications of with-transaction are supported -- only those statements executed within THUNK are committed or rolled back by any with-transaction call, as you would expect.
However, escaping or re-entering the dynamic extent of thunk will not commit or rollback the in-progress transaction, so it is highly discouraged to jump out of a transaction. You will definitely run into trouble, unless you can ensure that no other statements will be executed on this connection until the outermost with-transaction returns normally.
If you provide LEVEL (which can be the symbol read-committed or serializable) this will set the transaction isolation mode for the transaction. If you provide MODE (which can be the symbol read/write or read-only) this will set the access mode for the transaction.
LEVEL is only allowed in the outermost transaction (when in-transaction? returns #f); if you provide it in an inner transaction, an error is raised. In subtransactions, MODE can only be "downgraded" to read-only from inside a read/write transaction, but you can't "upgrade" to read/write from a read-only transaction.
[procedure] (in-transaction? CONN)Returns #t if there is currently a transaction in progress on the connection CONN. Returns #f if no transaction is in progress.
Low-level result API
This API allows you to inspect result objects on the individual row and column level.
[procedure] (result? OBJ)Returns #t when OBJ is a result object, #f otherwise.
[procedure] (clear-result! RES)Directly clean up all memory used by the result object. This is normally deferred until garbage collection, but it's made available for when you want more control over when results are released.
[procedure] (value-at RES [COLUMN [ROW]] [raw: RAW])Returns the value at the specified COLUMN and ROW. It is parsed by an appropriate parser unless RAW is specified and #t. If RAW is true, the value is either a string, bytevector or an sql-null object. Otherwise, it depends on the parsers.
If ROW or COLUMN are not specified, they default to zero. This makes for more convenient syntax if you're just reading out a result of a query which always has one row or even one value.
See type conversion for more info on parsers.
[procedure] (row-values RES [ROW] [raw: RAW])Returns a list of all the columns' values at the given ROW number. If ROW is omitted, it defaults to zero. If RAW is true, the values are either strings, bytevectors or sql-null objects. Otherwise, it depends on the parsers.
[procedure] (column-values RES [COLUMN] [raw: RAW])Returns a list of all the rows' values at the given COLUMN number. If COLUMN is omitted, it defaults to zero. If RAW is true, the values are either strings, bytevectors or sql-null objects. Otherwise, it depends on the parsers.
[procedure] (row-alist RES [ROW] [raw: RAW])Returns an alist of the values at the given ROW number. The keys of the alist are made up by the matching column names, as symbols.
If ROW is omitted, it defaults to zero. If RAW is true, the values are either strings, bytevectors or sql-null objects. Otherwise, it depends on the parsers.
[procedure] (affected-rows RES)For INSERT or UPDATE statements, this returns the number of rows affected by the statement that RES is a result for. Otherwise it's zero.
[procedure] (inserted-oid RES)For INSERT statements resulting in a single record being inserted, this returns the OID (a number) assigned to the newly inserted row. Returns #f for non-INSERT or multi-row INSERTs, or INSERTs into tables without OIDs.
[procedure] (row-count RES)Returns the number of rows in the result set.
[procedure] (column-count RES)Returns the number of columns in the result set.
[procedure] (column-index RES COLUMN)Returns the index of COLUMN in the result set. COLUMN should be a symbol indicating the column name.
[procedure] (column-name RES INDEX)Returns the name of the column (a symbol) at the position in the result set specified by INDEX. This is its aliased name in the result set.
[procedure] (column-names RES)Returns a list of all the column names (symbols) in the result set. The position in the list reflects the position of the column in the result set.
[procedure] (column-format RES INDEX)Returns the format of the column at INDEX, which is a symbol; Either text or binary. This determines whether the value returned by value-at will be a string or a bytevector.
[procedure] (column-type RES INDEX)Returns the OID (an integer) of the column at INDEX.
[procedure] (column-type-modifier RES INDEX)Returns an type-specific modifier (a number), or #f if the type has no modifier.
[procedure] (table-oid RES INDEX)The OID (a number) of the table from whence the result column at INDEX originated, or #f if the column is not a simple reference to a table column.
[procedure] (table-column-index RES INDEX)Returns the column number (within its table) of the column making up the query result column at the position specified by INDEX.
Note: This procedure returns indexes starting at zero, as one would expect. However, the underlying C function PQftablecol is one-based. This might trip up experienced Postgres hackers.
Value escaping
To embed arbitrary values in query strings, you must escape them first, to protect yourself from SQL injection bugs. This is not required if you use positional arguments (the PARAMS arguments in the query procedures).
[procedure] (escape-string CONNECTION STRING)Escapes special characters in STRING which are otherwise interpreted by the SQL parser, obeying the CONNECTION's encoding and escaping settings, using the escaping syntax for string contexts. This does NOT add surrounding quotes to the string; that's up to you to add.
Example:
;; This prevents people from changing a query's parse tree. ;; For example, they could try to turn a query like ;; SELECT * FROM USERS WHERE id='x' ;; into ;; SELECT * FROM USERS WHERE id='1' OR '1'='1' ;; by quoting the value for X, you get the intended parse tree: ;; SELECT * FROM USERS WHERE id='1''' OR ''1''=''1' (escape-string conn "1' OR '1'='1") => "1'' OR ''1''=''1" ;; Depending on the value of standard_conforming_strings you might also get (escape-string conn "1' OR '1'='1") => "1\\' OR \\'1\\'=\\'1" ;; Of course, when using these strings you still need to surround ;; the output of escape-string with single quotes[procedure] (quote-identifier CONNECTION STRING)
Escapes special characters in STRING which are otherwise interpreted by the SQL parser, obeying the CONNECTION's encoding settings and escaping settings, using the escaping syntax for identifier context. Identifiers are table names, aliases etc. Surrounding double quotes will be added.
This procedure corresponds to PQescapeIdentifier, but the name was changed to reflect the fact that it performs escaping and adds quotation marks around the string.
NOTE: This procedure is only available when the egg is built against the libpq from PostgreSQL 9.0 or later. If you are using an older version, this will raise a (exn postgresql unsupported-version) error with an upgrade message.
Example:
;; Spaces are normally not allowed in table names, but when you ;; quote them, they are allowed (quote-identifier conn "a table with spaces") => "\"a table with spaces\"" ;; Can't use a column or table called order because it is a ;; reserved word. However, escaping it makes it usable. (quote-identifier conn "order") => "\"order\"" ;; Table names are case-insensitive and always implicitly downcased. ;; If you need to access a table with a capital in its name, ;; quoting the table also helps: (quote-identifier conn "Foo") => "\"Foo\""[procedure] (escape-bytea CONNECTION OBJ)
Quotes special characters in OBJ (a string, or bytevector/u8vector), which would otherwise be interpreted by the SQL parser. This differs from escape-string in that some bytes are doubly encoded so they can be used for bytea columns.
This is required because of a technicality; PostgreSQL first parses the string value as a string, and then casts this string to bytea, interpreting another layer of escape codes.
For example, E'a\\000bcd' is first converted to 'a\000bcd' by the text parser, and then interpreted by the bytea parser as an "a" followed by a NUL byte, followed by "bcd". In Scheme, the value returned by (escape-bytea conn "a\x00bcd") is "a\\\\000bcd". Yes, that's a lot of backslashes :)
[procedure] (unescape-bytea STRING)This unescapes a bytea result from the server. It is not the inverse of escape-bytea, because string values returned by the server are not escaped for the text-parser. (ie, step one in the encoding process described under escape-bytea is skipped)
The result type of unescape-bytea is a u8vector (in earlier versions, this was a string).
COPY support
High-level COPY API
This API is experimental and as such should be expected to change. If you have suggestions on how to improve the API, please let me know!
[procedure] (copy-query-fold KONS KNIL CONNECTION QUERY . PARAMS)[procedure] (copy-query*-fold KONS KNIL CONNECTION QUERY [PARAMS] [format: FORMAT] [raw: RAW?])
This is the fundamental COPY TO STDOUT iterator. It calls (kons data seed) for every row of COPY data returned by QUERY, where data is either a string or a bytevector depending on whether the COPY query asked for binary or text data and seed is the accumulated result from previous calls (initially knil), ie its pattern looks like (KONS DATAN ... (KONS DATA2 (KONS DATA1 KNIL))). It returns the final accumulated result.
The starred and nonstarred version are analogous to query and query*.
(import postgresql srfi-13) (let ((conn (connect '((dbname . test))))) (copy-query-fold (lambda (data sum) ;; Note the string-drop-right is necessary because the default ;; COPY format uses newlines to indicate row endings (+ (string->number (string-drop-right data 1)) sum)) 0 conn "COPY (SELECT 1 UNION SELECT 2) TO STDOUT")) => 3[procedure] (copy-query-fold-right KONS KNIL CONNECTION . PARAMS)
[procedure] (copy-query*-fold-right KONS KNIL CONNECTION [PARAMS] [format: FORMAT] [raw: RAW?])
The fundamental COPY TO STDOUT recursion operator; Calls (KONS DATA1 (KONS DATA2 (KONS ... KNIL))) instead of (KONS DATAN ... (KONS DATA2 (KONS DATA1 KNIL))).
Warning: It is not recommended to use this when the returned data is very big. It is usually much cheaper (memory-wise) to use copy-query-fold and reverse the result object, if the object's type supports that.
The starred and nonstarred version are analogous to query and query*.
[procedure] (copy-query-map PROC CONNECTION . PARAMS)[procedure] (copy-query*-map PROC CONNECTION [PARAMS] [format: FORMAT] [raw: RAW?])
Maps COPY TO STDOUT output rows from QUERY to lists by calling PROC on each data row returned by the server. If the QUERY asked for binary data, the data supplied to PROC will be in bytevector form. Otherwise, the data will be provided as strings.
The starred and nonstarred version are analogous to query and query*.
(import postgresql (chicken string)) (let ((conn (connect '((dbname . test))))) (copy-map (lambda (x) (apply + (map string->number (string-split x)))) conn "COPY (SELECT 1, 100 UNION SELECT 2, 200) TO STDOUT")) => (101 202)[procedure] (copy-query-for-each PROC CONNECTION . PARAMS)
[procedure] (copy-query*-for-each PROC CONNECTION [PARAMS] [format: FORMAT] [raw: RAW?])
Simple for-each, calling the (PROC data) on each row of COPY TO STDOUT data returned by QUERY, in turn, only for the purpose of its side-effects.
The starred and nonstarred version are analogous to query and query*.
[procedure] (with-output-to-copy-query THUNK CONNECTION QUERY . PARAMS)[procedure] (with-output-to-copy-query* THUNK CONNECTION QUERY [PARAMS] [format: FORMAT] [raw: RAW?])
Call THUNK with CURRENT-OUTPUT-PORT parameterized to a port that writes COPY FROM STDIN data to the database connection CONN for QUERY.
Returns the final result of THUNK.
The starred and nonstarred version are analogous to query and query*.
(import postgresql) (with-output-to-copy-query (lambda () (print "one\t1") (print "two\t2") "That's a wrap") conn "COPY my_table (textcol, numbercol) FROM STDIN") => "That's a wrap"[procedure] (call-with-output-copy-query PROC CONNECTION QUERY . PARAMS)
[procedure] (call-with-output-copy-query* PROC CONNECTION QUERY [PARAMS] [format: FORMAT] [raw: RAW?])
Like with-output-to-copy-query, except it calls PROC with one argument (the copy port) instead of parameterizing CURRENT-OUTPUT-PORT.
Low-level COPY API
This API is close to the C API. It requires you to first execute a COPY query (using the query procedure), and then you can put or get data from the connection. You cannot run other queries while the connection is in COPY state.
[procedure] (put-copy-data CONNECTION DATA)Put copy data on the CONNECTION. DATA is either a string or a bytevector/u8vector and should be in the format expected by the server.
[procedure] (put-copy-end CONNECTION [ERROR-MESSAGE])This ends the COPY process. If ERROR-MESSAGE is supplied and not #f, the data sent up till now is discarded by the server and an error message is triggered on the server. If ERROR-MESSAGE is not supplied or #f, the server will commit the copied data to the target table and succeed.
A result object is returned upon success. This result object is currently not useful.
[procedure] (get-copy-data CONNECTION [format: FORMAT])Obtain one row of COPY data from the server. The data's contents will be in the format indicated by the server. If FORMAT is 'text, it the data will be returned as a string, if it is 'binary, it will be returned as a bytevector. The user is responsible for providing the right format to match the output format of the query sent earlier.
After the last row is received, this procedure returns a result object (which can be detected by calling result? on it).
Constants
[constant] invalid-oidRepresents the numeric value of the invalid Oid. Rarely useful, except perhaps when doing low-level operations in the system catalog.
Error handling
condition: postgresql
A condition of kind (exn <subtype> postgresql) is signaled when an error occurs. The <subtype> is one of the following:
- query
- There was an error while executing a statement or query.
- parse
- Something went wrong in a parser.
- unparse
- Something went wrong in an unparser.
- i/o
- Something went wrong while trying to read from or write to the connection.
- connect
- Something went wrong during (re)connections. This includes errors during connection reset.
- bounds
- An out of bounds error happened (e.g., trying to read from a nonexistant column or row index).
- type
- Invalid type was passed by the user.
- domain
- A value was passed in an inappropriate context.
- unsupported-version
- An operation was performed which is not supported by the client library.
- internal
- A truly unexpected error occurred (unrecognised status codes, etc).
There will always be a subtype. If the condition contains a query condition, this holds a lot of extra properties which you can extract to gain more information about the cause of the error, or to display to the user.
You'll always find all of these properties in the query component of the condition, but most may have a #f value.
- severity
- One of the symbols error, fatal, panic, warning, notice, debug, info, log (unfortunately, this symbol may also be translated/localised, so you should not dispatch on them in code: use error-class and error-code for that). Always present in query type subconditions.
- error-class
- A string representing a Postgresql error class (the first two characters of error-code). Always present in query type subconditions.
- error-code
- A string representing the full Postgresql error code (including the code class prefix). See the Postgresql documentation for a description of error codes and error classes. Always present in query type subconditions.
- message-primary
- The main error message. Always present in query type subconditions. The exn message will be a combination of this plus the optional detail and hint.
- message-detail
- A secondary message with extra detail about the problem.
- message-hint
- A string with a suggestion about what to do about the problem.
- statement-position
- An integer indicating an error cursor position as an index into the original statement string. The first character has index 1, and positions are measured in characters, not bytes.
- context
- An indication of the context in which the error occurred. Presently this includes a call stack traceback of active PL functions. The trace is one entry per line, most recent first.
- source-file
- The file name of the Postgresql source-code location where the error was reported.
- source-line
- The line number in source-file where the error was reported (integer).
- source-function
- The name of the source-code function reporting the error.
- internal-query
- A string containing the source text of an "internally generated" command where the error occurred (for example when you called a PL/PGSQL function which generates a query).
- internal-position
- An integer indicating the position in internal-query where the error occurred.
- schema-name
- The name of the schema ("database") in which the error occurred. This is only available when the error is associated with a specific database object.
- table-name
- The name of the table in which the error occurred. This is only available when the error is associated with a specific table.
- column-name
- The name of the column on which the error occurred. This is only available when the error is associated with a specific column.
- datatype-name
- The name of the data type on which the error occurred. This is only available when the error is associated with a specific data type (eg, a custom domain).
- constraint-name
- The name of the constraint which was violated, if any.
Type conversion
Type information is read from the database the first time you connect to it. Result set values are either text or binary (or sql null). If they are text, they are converted to Scheme objects by type parsers, as described below. If they are binary, they will be returned as unprocessed bytevectors (which you can then convert to strings using utf8->string).
Parsers
[parameter] (default-type-parsers [ALIST])Postgres result values are always just strings, but it is possible to map these to real Scheme objects. With this parameter, you can map your own custom postgresql datatype to Scheme datatypes.
The alist is a mapping of Postgres type names (strings) to procedures accepting a string and returning a Scheme object of the desired type. To discover the appropriate name string to use for a desired Postgres type, check pg_catalog.
The parsers can also be set per connection with the TYPE-PARSERS argument of the connect procedure.
(import postgresql) (parameterize ((default-type-parsers `(("text" . ,string->symbol)))) (let ((conn (connect '((dbname . test))))) (symbol? (value-at (query conn "SELECT 'hello'::text"))))) => #t
The default parsers look like this:
`(("text" . ,identity)
("bytea" . ,bytea-parser)
("char" . ,char-parser)
("bpchar" . ,identity)
("bool" . ,bool-parser)
("int8" . ,numeric-parser)
("int4" . ,numeric-parser)
("int2" . ,numeric-parser)
("float4" . ,numeric-parser)
("float8" . ,numeric-parser)
("numeric" . ,numeric-parser)
("oid" . ,numeric-parser)
("record" . ,(make-composite-parser (circular-list identity))))
These parsers are described below. For anything where no parser is found, the value is returned verbatim (which is always a string, or a bytevector in case of binary data).
Array and composite (row) types are automatically handled; unless a type-specific parser is defined, a parser is automatically created by combining the parsers for their constituent elements.
[procedure] (update-type-parsers! CONN [TYPE-PARSERS])As described above, type information is extracted from the system catalog whenever you initiate a new connection. However, there is a bootstrapping problem when you are defining custom data types. You must first connect before you can define your custom data types. But the type parsers do not have the information for this new type yet, so you must update them.
To do this, you can call update-type-parsers!. This procedure updates all the type parsers originally associated with connection CONN. By providing the optional TYPE-PARSERS, you can override the existing type parsers for this connection with new ones, otherwise the old ones are just refreshed.
[procedure] (bool-parser STR)Returns #t if the string equals "t", #f otherwise.
[procedure] (bytea-parser STR)Returns a u8vector containing the bytes in STR, after unescaping it using unescape-bytea.
[procedure] (char-parser STR)Returns the first character in STR.
[procedure] (numeric-parser STR)Returns STR converted to a number using decimal representation. If STR could not be converted to a number, raises an error.
[procedure] (make-array-parser SUBPARSER [DELIMITER])Returns a procedure that can be used to parse arrays containing elements that SUBPARSER parses. It will split the elements using the DELIMITER character, which defaults to #\,.
For example, to create a parser for arrays of integers, use (make-array-parser numeric-parser).
[procedure] (make-composite-parser SUBPARSERS)Returns a procedure that can be used to parse composite values (aka "row values"). It will use the list of SUBPARSERS to parse each element in the row by looking up the parser at the matching position in the list. For example, to create a parser for rows containing an integer and a boolean, use (make-composite-parser (list numeric-parser bool-parser)).
Unparsers
[parameter] (default-type-unparsers [ALIST])Just as PostgreSQL types are converted to Scheme types in result sets, Scheme types need to be converted to PostgreSQL types when providing positional parameters to queries. For this, the library uses type unparsers. Just like type parsers, you can override them either per-connection using the TYPE-UNPARSERS parameter to the connect procedure, or globally by changing a parameter.
This alist is a mapping of predicates to unparsers. Predicates are procedures which accept a scheme object and return a true value if the object is of the type for which the unparser is intended. Unparsers are procedures which accept two arguments; the connection object and the scheme object to unparse. Unparsers return either a string, a bytevector or an sql-null object to be used in the query.
It is not necessary to reload type unparsers after defining a new data type in the database.
Order matters; the type unparser alist is traversed from left to right, trying predicates in order and invoking the unparser linked to the first predicate that does not return #f. If none of the predicates match, the type must be of string, bytevector or sql-null type. If not, the query procedure will raise an error.
The default unparsers look like this:
`((,string? . ,(lambda (conn s) s))
(,u8vector? . ,(lambda (conn v) v))
(,char? . ,(lambda (conn c) (string c)))
(,boolean? . ,bool-unparser)
(,number? . ,(lambda (conn n) (number->string n)))
(,vector? . ,vector-unparser)
(,pair? . ,list-unparser))
[procedure] (update-type-unparsers! CONN TYPE-UNPARSERS)
Similar to update-type-parsers!, this procedure allows you to update all the type unparsers originally associated with connection CONN.
[procedure] (bool-unparser CONN B)Returns "TRUE" for true values and "FALSE" for #f.
[procedure] (vector-unparser CONN V)Returns a string representing an array containing the objects in the vector V. The elements of V are unparsed recursively by their respective subparsers. It is the responsibility of the program to use correct values for an array; the elements should all be of the same type and, if they are vectors themselves, all vectors should have the same length and recursive vector depth. Otherwise, you will get an error from postgresql.
[procedure] (list-unparser CONN L)Returns a string representing a composite object (aka row value) containing the objects in the list L. The elements of L are unparsed recursively by their respective subparsers.
LISTEN/NOTIFY support
PostgreSQL supports asynchronous notifications through LISTEN and NOTIFY.
When you want to receive these notifications, you can simply issue a LISTEN on the desired channel via SQL. Then, when the notification arrives, the connection's notify handler will be invoked. The handler can be set when initialising the connection.
[parameter] (default-notify-handler [HANDLER])The handler that will be used when setting up new connections, if none was supplied to connect. This must be a procedure of three arguments, or #f if you don't want to handle notifications.
The three arguments are: A channel name (string), the PID of the backend which triggered the NOTIFY (integer) and the notification payload (string).
[procedure] (set-notify-handler! CONN HANDLER)When you have an existing connection CONN, this procedure can update its NOTIFY handler to a new value.
HANDLER must be a procedure of three arguments, or #f if you don't want to handle notifications. The three arguments are: A channel name (string), the PID of the backend which triggered the NOTIFY (integer) and the notification payload (string).
[procedure] (wait-for-notifications! CONN [DELAY])Normally, asynchronous notifications will be piggy-backed on responses after you issue a command to the backend. But sometimes you just want to idle until a notification arrives. You can use wait-for-notifications! to do this. It will wait until either DELAY (in milliseconds) has passed, or an event arrives on the connection. If DELAY is #f (or not supplied), it will wait forever, until an event arrives.
Changelog
- 5.0.0 Port to CHICKEN 6
- 4.1.5 Fix tests for newer server versions which don't support WITH(OIDS=true) and changed pg_types system catalog
- 4.1.4 Fix tests for newer server versions which don't report a "micro" version number (reported by Kon Lovett)
- 4.1.3 Fix build script, which missed a shebang line (pointed out by Ricardo G. Herdt)
- 4.1.2 Fix dependency on scheduler unit so it doesn't crash on unbound variables when srfi-18 is not loaded.
- 4.1.1 Forgot to expose default-notify-handler
- 4.1.0 Add LISTEN/NOTIFY support.
- 4.0.0 Port to CHICKEN 5.
- 3.9.4 If pkg-config doesn't know about libpq, don't use it.
- 3.9.3 Use pkg-config when available, for increased cross-compilability (thanks to Mario Goulart)
- 3.9.2 Bugfix for strange mistake in 3.9.1
- 3.9.1 Fix escape-string so it doesn't return strings with trailing whitespace when there's nothing to escape.
- 3.9 Remove deprecated properties on postgresql condition. Add support for diagnostic schema info on errors. Add connected? procedure.
- 3.8 Rip out bogus test for reset-connection, thanks to "Herr". Improve error handling. Deprecate properties on postgresql condition (they're now properties on query, instead). Several performance improvements. unescape-bytea now returns blobs, escape-bytea accepts blobs, strings and srfi-4 vectors.
- 3.7.2 Again some tweaks for MacOS X build, thanks to Kon Lovett.
- 3.7.1 Fix installation on MacOS (with latest XCode?). Thanks to Kon Lovett for reporting this.
- 3.7 Add quote-identifier and use PQconnectStartParams where available, add basic support for executing prepared statements. Add transaction isolation and access mode keywords to with-transaction.
- 3.6.3 Restore compatibility with PostgreSQL 8.3 and older versions [Thanks to Mario Goulart].
- 3.6.2 Fix tests for PostgreSQL 9.0, prefer one null column over zero columns when faced with ambiguous composite value output syntax "()".
- 3.6.1 Fix a small problem in the way composite values are parsed so that trailing NULL values are properly deserialized.
- 3.6 Fix problem with built-in "name" type which got falsely interpreted as an array (and possibly others as well) [Reported by David Krentzlin]. Fix foreign type signature for escape-bytea internal function so it correctly accepts bytea values containing NUL bytes (Chicken 4.6.0 and later).
- 3.5.1 Small change in type information query so it works in Postgres 8.2 (and possibly older versions). [Thanks to Alaric Snell-Pym and Florian Zumbiehl]
- 3.5 Add support for composite and array types. Change semantics of 'raw' option to query* procedure. Backwards incompatible change; pass connection to all type unparsers.
- 3.4.1 Improve COPY interface
- 3.4 Add "raw" option to row-alist for consistency. Fix library flags in setup script [Thanks to Felix]. Add COPY support. Remove multi-query.
- 3.3 - Fix connection over TCP/IP problem
- 3.2 - Add transaction support.
- 3.1 - Small (but backwards incompatible) interface improvement: make row and column arguments optional.
- 3.0 - Port to Chicken 4. Major code overhaul, backwards incompatible.
- 2.0.14 - Add PQescapeStringConn functionality to pg:escape-string. Fixed weird compilation issue with SRFI-69 by removing it from USES list.
- 2.0.12 - added pg:named-tuples parameter [Contributed by Graham Fawcett]
- 2.0.11 - added syntax-case requirements to .meta file [Thanks to Michele Simionato]
- 2.0.10 - adapted to new easyffi usage [Thanks to rreal]
- 2.0.9 - Yet more improvements to error reporting
- 2.0.8 - More detailed error information, export pg:sql-null-object
- 2.0.7 - Added missing error-function [Thanks to Reed Sheridan]
- 2.0.6 - Removed dependency on format [Thanks to Reed Sheridan]
- 2.0.5 - Some bugfixes and pq:escape-string by Reed Sheridan; adapted to SRFI-69 hash-tables
- 2.0.4 - Changed usage of hygienic macros in setup script
- 2.0.3 - Bugfixes.
- 2.0.0 - Interface improvements. (Backward-incompatible.)
- 1.2.1 - Non-blocking queries.
- 1.2.0 - Optimizations, minor fixes and cleanups.
License
Copyright (C) 2008-2024 Peter Bex Copyright (C) 2004 Johannes Grødem <johs@copyleft.no> Redistribution and use in source and binary forms, with or without modification, is permitted. THIS SOFTWARE IS PROVIDED BY THE AUTHOR ``AS IS'' AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE AUTHOR OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.