Differences between revisions 20 and 21
Revision 20 as of 2013-06-05 22:01:44
Size: 6336
Comment: out-of-date tip, official docs link
Revision 21 as of 2013-06-05 22:15:39
Size: 6335
Comment: fix docs link
Deletions are marked like this. Additions are marked like this.
Line 5: Line 5:
'''NOTE''': A much better way to do this is via the new HTTP Proxying feature. See the [[http://docs.couchdb.org/en/latest/configuring.html#http-proxying||official documentation]] for that topic. '''NOTE''': A much better way to do this is via the new HTTP Proxying feature. See the [[http://docs.couchdb.org/en/latest/configuring.html#http-proxying|official documentation]] for that topic.

We have a new wiki. The migration is not 100% complete. You can help out by moving pages across. This wiki will exist for as long as there are pages left.

The official documentation has moved to http://docs.couchdb.org — The transition is not 100% complete, but http://docs.couchdb.org should be seen as having the latest info. In some cases, the wiki still has some more or older info on certain topics inside CouchDB.

You need to be added to the ContributorsGroup to edit the wiki. But don't worry! Just email any Mailing List or grab us on IRC and let us know your user name.

External Processes

NOTE: A much better way to do this is via the new HTTP Proxying feature. See the official documentation for that topic.

CouchDB now allows for the ability to develop custom behaviors via processes that communicate over stdin and stdout. Requests to CouchDB that are captured by the external process handler are passed via JSON object to the external process over stdin and reads a JSON object from stdout. Without further ado...

JSON Requests

Requests capture information about the incoming HTTP request and transform it into a JSON object. I've formatted the object here, though in real life this object would contain no new lines and all embedded white space would be normalized to a single ' ' (space) character.

An example object:

   1 {
   2     'body': 'undefined',
   3     'cookie': {
   4         '__utma': '96992031.3087658685658095000.1224404084.1226129950.1226169567.5',
   5         '__utmz': '96992031.1224404084.1.1.utmcsr'
   6     },
   7     'form': {},
   8     'info': {
   9         'compact_running': False,
  10         'db_name': 'couchbox',
  11         'disk_size': 50559251,
  12         'doc_count': 9706,
  13         'doc_del_count': 0,
  14         'purge_seq': 0,
  15         'update_seq': 9706},
  16     'path': [],
  17     'query': {},
  18     'method': 'GET'
  19 }

In order:

  • body - Raw post body

  • cookie - Cookie information passed on from mochiweb

  • form - If the request's Content-Type is "application/x-www-form-urlencoded", a decoded version of the body

  • info - Same structure as returned by http://127.0.0.1:5984/db_name/

  • path - Any extra path information after routing to the external process

  • query - Decoded version of the query string parameters.

  • method - HTTP request verb

  • userCtx - Information about the User

Note: Before CouchDB 0.11 method was verb.

JSON Response

The response object has five possible elements

  • code - HTTP response code [Default is 200]. Note that this must be a number and cannot be a string (no "").

  • headers - An object with key-value pairs that specify HTTP headers to send to the client.

  • json - An arbitrary JSON object to send the client. Automatically sets the Content-Type header to "application/json"

  • body - An arbitrary CLOB to be sent to the client. Content-Type header defaults to "text/html"

  • base64 - Arbitrary binary data for the response body, base64-encoded

While nothing breaks if you specify both a json and body member, it is undefined which response will be used. If you specify a Content-Type header in the headers member, it will override the default.

Common Pitfalls

  • When responding to queries always remember to turn off buffering for stdout or issue a flush() call on the file handle.

  • All interaction is in the form of single lines. Each response should include *exactly* one new line that terminates the JSON object.
  • When using base64 encoders, be sure to strip any CRLF from the result - most encoders will add CRLF after 76 characters and at the end.
  • CouchDB 0.10 looks for a case-sensitive match of the Content-Type header -- a user-defined header must specify "Content-Type", not "content-type" or "CoNtEnT-type". This is fixed in future releases.
  • When developing handlers you need to restart CouchDB after each change as it doesn't see the changes until you restart the server.
  • Notes for OSX users: If you are using launchctl to load and unload your CouchDB instance, it by default starts the couchdb instance with user couchdb. External handlers won't run if they are not executable by this user. Calls to them will fail with a 'OS Process timeout' error stack trace from Erlang. If you start couchdb with sudo couchdb things will work fine. Here is an article in more detail http://www.vertigrated.com/blog/2010/04/couchdb-launchctl-external-httpd-handlers-madness/

Configuration

Adding external processes is as easy as pie. Simply place key=command pairs in the [external] section of your local.ini and then map those handlers in the [httpd_db_handlers] section, like:

;Including [log] and [update_notification] for context

[log]
level = info

[external]
test = python /usr/local/src/couchdb/test.py

[httpd_db_handlers]
_test = {couch_httpd_external, handle_external_req, <<"test">>}

[update_notification]
;unique notifier name=/full/path/to/exe -with "cmd line arg"

This configuration will make the /usr/local/src/couchdb/test.py responsible for handling requests from the url:

http://127.0.0.1:5984/${dbname}/_test

Example External Process

Here is a complete Python external process that does a whole lot of nothing except show the mechanics.

   1 import sys
   2 
   3 try:
   4     # Python 2.6
   5     import json
   6 except:
   7     # Prior to 2.6 requires simplejson
   8     import simplejson as json
   9 
  10 def requests():
  11     # 'for line in sys.stdin' won't work here
  12     line = sys.stdin.readline()
  13     while line:
  14         yield json.loads(line)
  15         line = sys.stdin.readline()
  16 
  17 def respond(code=200, data={}, headers={}):
  18     sys.stdout.write("%s\n" % json.dumps({"code": code, "json": data, "headers": headers}))
  19     sys.stdout.flush()
  20 
  21 def main():
  22     for req in requests():
  23         respond(data={"qs": req["query"]})
  24 
  25 if __name__ == "__main__":
  26     main()

A Java example can be found here: http://daily.profeth.de/2009/12/apache-couchdb-external-process-using.html

HTTP proxying

As of November 2010, Couchdb has gained the ability to manage external OS processes and to proxy to an external HTTP server.

This lets you integrate an external HTTP app server, written in any language you like, without using the JSON protocol defined above. This means you can write extensions using regular frameworks and process binary (non-UTF8, non-JSON) data. It is intended to replace the [external] mechanism.

ExternalProcesses (last edited 2013-06-05 22:15:39 by NathanVanderWilt)