Changes in version 0.4-0 (2026-02-19) o chunk.tapply() has been fully re-written and it takes advantage key-aware chunk reader. The sep= argument is ignored if the input is a chunk reader and defaults to "\t" otherwise. o The implementation now works both with and without growabale vector API and across a wide range of R versions. o remove superfluous rawToChar() on column names (#38) Changes in version 0.3-5 (2023-12-02) o add format casts Changes in version 0.3-4 o pass max.size argument through chunk.map() (#39) o minor change to work around rchk not being able to follow protections across functions. Changes in version 0.3-3 o fix error/segfault (depending on R version) in as.output() when a type that doesn't support LENGTH() is passed (such as NULL). o CH.MAX.SIZE was ignored in chunk.apply() for parallel jobs o add CH.BINARY flag which can be set to TRUE if the merge step should be performed continually as a call to a binary CH.MERGE function instead of collecting all results and then calling CH.MERGE. Analogously, CH.INITIAL has been added which is a function called on the first result. If NULL then CH.MERGE(NULL, result) is called instead. Note: in previous versions regular chunk.apply() was behaving like CH.BINARY=FALSE, but when parallel was set then it behaved like CH.BINARY=TRUE. Now CH.BINARY is explicit. o new parallel chunk.apply() implementation The related arguments have been re-named to avoid clashes with actual function arguments. CH.MERGE now behaves the same way as with sequential processing for consistency. CH.PARALLEL - if set to 2 or higher triggers parallel processing of chunks CH.SEQUENTIAL - if FALSE then parallel processing is allowed to change the order of the chunks to process chunks yield results faster frist. Changes in version 0.3-2 (2021-07-23) o minor changes for compatibility with write-barrier and R-devel (no functional difference) Changes in version 0.3-1 (2020-03-09) o make sure connections are closed in examples so check doesn't complain o add PROTECT() to chunk.apply() and string singletons Changes in version 0.3-0 o integers incorrectly parsed empty strings to 0 instead of NA (#27) o add as.output.raw() which supports both direct file descriptors and connections o Extend the handling of as.output() as.output() now supports three modes: 1) con=NULL: a raw vector is created 2) con=connection: writes output to binary connection 3) con=iotools.stderr/stdout/fd(fd): writes directly to a file descriptor Also as.output() is now pass-through for raw vectors. Finally, most methods now support keys to be either a logical value to suppress names/row names or it can also be a character vector in which case its content is used as keys. Changes in version 0.2-6 o add support for logical vectors in fdrbind Changes in version 0.2-5 (2018-01-25) o disable non-blocking raw fd reads on Windows since select() does NOT work on FDs there. Changes in version 0.2-4 o remove unnecessary reference to stdout o increase tmeporary buffer to (hopefully) appease gcc7 o add stdout_writeBin C code o add fdrbind() Changes in version 0.2-3 o fix a bug in timeout parameter of read.chunk() where subsecod timeouts were computed incorrectly Changes in version 0.2-2 o add support for raw file descriptors and timeout in the chunk reader Changes in version 0.2-1 o use R_GetConnection() API in R >=3.3.0 o add chunk.map to mimic hmr locally o fix col.names handing in write.csv.raw() (#26) o clean up as_output_matrix to be 64-bit safe o use internal C methods for all output support ragged lists (with recycling) and long vectors in as_output_dataframe o support I() to tag ojebcts that don't want to use as.character() o make string coersion rules consistent o re-factor as.output.data.frame to use dybuf o support binary connection con in as.output() instead of buffering o add support for quoting via quote= parameter (#25) Changes in version 0.1-12 (2015-07-31) o don't import parallel::mc* since it doesn't exist on Windows Changes in version 0.1-11 o fix issues, mostly convert to 64-bit Changes in version 0.1-10 o remove old stdio API o add quoting to read.csv.raw o support quotes in character fields (#24) Changes in version 0.1-9 o fix handing of Windows line endings (#23) Changes in version 0.1-8 o add support for iterators - imstrsplit/idstrsplit (Thanks to Mike Kane! - #19) o add tests and fixes to make them run on edge cases o fix mstrsplit when given length zero input o re-factor as.output() to use dynamic buffers Changes in version 0.1-7 o add C implementations of as.output() Changes in version 0.1-6 o support tab/comma separated files with as.output() when x is a data.frame or matrix o make loading hmr silently the default until we rename hmr and go to CRAN o fix header=TRUE bug o treat NAs in dstrsplit list input as a way to skip columns Changes in version 0.1-5 o Removed "pipeline" parameter for chunk.apply and updated the documentation o Parallel option added to chunk.apply() o major re-structuring of the raw parsers (dstrsplit and mstrsplit) previous versions included code for Hadoop Map/Reduce, that code has now been moved to a separate package: https://github.com/s-u/hmr Changes in version 0.1-4 o support names from colspec, support list colspec o add experimental remote submission capability Changes in version 0.1-3 o add hadoop.opt option and hadoop.conf support Changes in version 0.1-2 o fix missing PROTECT in chunk.tapply Changes in version 0.1-1 o add key-awareness when splitting o add ctapply() - more efficient implementation of tapply() for contiguous keys o add support for Hadoop 2.x Changes in version 0.1-0 o initial public release