Dear all, It is my pleasure to announce the release of RumbleDB 1.16 "Shagbark Hickory" beta. RumbleDB takes XQuery's little cousin, JSONiq, to the Petabyte level on large scale clusters -- but also feels right at home on your 8-core laptop where it seamlessly spreads the processing of your disk data over all your cores. RumbleDB has also become a Swiss army knife with which you can easily convert your datasets between all kinds of formats (JSON, Parquet...) as well as clean up, structure, normalize, validate your data and feed it right into Machine Learning pipelines (Rumble ML) -- all within the JSONiq language. As always, it remains free and open source on http://www.rumbledb.org and is especially useful in a teaching context as we now have several universities using them in their DB courses. There is a public sandbox accessible in this notebook: https://colab.research.google.com/github/RumbleDB/rumble/blob/master/RumbleSandbox.ipynb We will also present RumbleDB at Declarative Amsterdam this week. New features include the following (many thanks to our users who asked for some of them -- this release was very user-centric): - there is a new --query parameter to enter your query directly on the command line (no need to save it to a file any more) - there is a new --shell-filter parameter for modifying the way the output is shown if you work in shell mode (e.g. --shell-filter 'jq . -S -C' for pretty printing with jq) - there are new output formats: TYSON (www.tyson-spec.org) for labeling each value (also nested) with its type, json with stricter semantics (top-level strings are quoted) in addition to the other formats (parquet, csv, etc). The default remains json-xml-hybrid which doesn't quote top-level strings, similar to what happens with XQuery. - there is a new JSound (www.jsound-spec.org) validator page in the RumbleDB server, and there is one publicly available at http://public.rumbledb.org:9090/jsound-validator.html - you can define user-defined atomic types with the JSound verbose syntax: the builtin atomic types and facets are the same as XML Schema. - in particular all builtin types (except XML-specific ones like NMTOKEN, IDREFS, etc) are now supported: newly supported are byte, dateTimeStamp, gDay, gMonth, gYear, gYearMonth, gMonthDay, int, long, negativeInteger, nonNegativeInteger, positiveInteger, nonPositiveInteger, unsignedInt, unsignedLong, unsignedByte, unsignedShort, short, - you can define user-defined array types via the JSound verbose syntax, including subtypes - you can define user-defined, open object types via the JSound verbose syntax (they are, of course, not implemented as DataFrames, but this makes no difference at the syntactic level except they cannot be used with ML estimators and transformers) - plenty of new builtin functions supported: fn:min#2, fn:max#2, fn:unordered#1, fn:distinct-values#2, fn:index-of#3, fn:deep-equal#3, fn:string#0, fn:string#1, fn:substring-before#3, fn:substring-after#3, fn:string-length#0, fn:resolve-uri#1, fn:resolve-uri#2, fn:ends-width#3, fn:starts-width#3, fn:contains#3, , fn:normalize-space#0, fn:default-collation#0, fn:number#0, fn:implicit-timezone#0, fn:not#0, fn:static-base-uri#1, fn:dateTime#2, fn:false#0, fn:true#0 - fixes in the serialization of floats and doubles to be more W3C-conformant - many bug fixes (we now use the W3C XQuery test suite actively to follow the standard as close as we can and hunt for bugs) Kind regards, Ghislain
Attachment:
smime.p7s
Description: application/pkcs7-signature