Ralph Corderoy [Mon, 15 May 2017 13:12:21 +0000 (14:12 +0100)]
Specify function parameters in prototypes, mainly void.
These prototypes are all in *.[cl] source compared with the *.h of the
earlier commit. All but one had void added, that one's parameter was a
char pointer.
Ralph Corderoy [Sun, 14 May 2017 18:35:13 +0000 (19:35 +0100)]
sbr/dtimep.l: Remove redundant wrapping parenthesis in definitions.
It's been decades since a lex(1) didn't follow POSIX and treat `{foo}'
as parenthesis surrounding foo's definition. Other nmh lex files work
without the extra wrapping.
If the profile component nmh-storage was a single character, e.g. the
relative directory `d', then it would be dropped from the output path so
instead of `d/foo' being written, `/foo' would probably fail. I think
this was due to a faulty check for nmh-storage being `/'. Add a test
for a non-/ single-character nmh-storage.
Ralph Corderoy [Sun, 14 May 2017 10:23:33 +0000 (11:23 +0100)]
Use new PLURALS(n) macro instead of variety of tests.
Whether to output a plural noun in a message was decided by ternary
operators that tested n==1, n!=1, n>1, etc. Make them all consistent by
using PLURALS(n).
Ralph Corderoy [Sat, 13 May 2017 17:39:06 +0000 (18:39 +0100)]
uip/folder.c: Rewrite plural test to common form.
The common form is a choice between "" and "s". This one occurrence was
using " " and "s" so either output was a fixed width. Alter the
printf(3) format string from `%s' to `%1s' to provide that instead.
Allows the new code to be altered by an upcoming `plural' change.
Ralph Corderoy [Sat, 13 May 2017 17:35:15 +0000 (18:35 +0100)]
Makefile.am: Alter long lists to be sorted, one entry per line.
It was hard to see what files were included in some variables when
searching for the best place for new content. Some variables had
definitions that were almost sorted, by weren't. Switch to a
one-entry-per-line list, in `LC_ALL=C sort' order, documented at the
start of the file. Makes it easy to run through the lists and see the
patterns.
The space padding was added at the start of the output buffer, not the
start of the component being formatted. Caused by 92128dac's move to
dynamic allocation for fmt_scan()'s output. Only shows if the component
isn't at the start of the buffer. Expand existing
test/format/test-rightjustify to cover this.
Ralph Corderoy [Tue, 9 May 2017 22:10:06 +0000 (23:10 +0100)]
config/version.sh: Rewrite. Use uname(1), git-describe(1), and UTC.
Not sure why it searched through PATH manually for uname(1) and
hostname(1). uname and its -n option are POSIX so just use those. It
was the preference over hostname anyway.
Use git-describe(1), not just git-branch(1), as it gives more detail,
including --dirty to show the built source differs from the commit.
Specify the format for the build date, and its timezone; +0000.
Ralph Corderoy [Sun, 7 May 2017 17:02:03 +0000 (18:02 +0100)]
uip/picksbr.c: Increase line-buffer size for "grep" action.
Alter LBSIZE from 1024 to NMH_BUFSIZ, e.g. 8192. This dominates the
size of the char array used to hold a single unfolded header line when
evaluating a "grep" action, e.g. `-to foo' becomes `^to[ ^I]*:.*foo'.
Ralph Corderoy [Sun, 7 May 2017 13:01:48 +0000 (14:01 +0100)]
uip/scansbr.c: Replace uses of SBUFSIZ with NMH_BUFSIZ.
Now that the former is defined as the latter, the uses of SBUFSIZ were
all in the same expression and it simplifies from a ternary expression
to NMH_BUFSIZ.
Ralph Corderoy [Sun, 7 May 2017 12:57:39 +0000 (13:57 +0100)]
uip/rcvdist.c: Replace SBUFSIZ with NMH_BUFSIZ.
The former was defined as the latter and had only one proper use: to
size a char array. The other uses should have been the sizeof operator
on that array, and now are.
Ralph Corderoy [Sun, 7 May 2017 12:03:01 +0000 (13:03 +0100)]
Make many m_getfld() buffer parameters NMH_BUFSIZ big.
char arrays were often stdio.h's BUFSIZ large, and that's 8192 with here
with glibc 2.25-1 on Linux x86_64, so using NMH_BUFSIZ is no different
as that's max(BUFSIZ, 8192). But some were 256 or 512 with a local
SBUFSIZ macro and they caused scan(1) to truncate a field, or pick(1) to
not spot text because the field was truncated.
Delete bad-input/test-header's test for m_getfld()'s detection for a
header without a colon that's under the header-length NAMESZ limit, but
longer than the passed in buffer. This no longer happens in scan as the
buffer is larger than the longest allowed header.
Ralph Corderoy [Fri, 5 May 2017 12:11:15 +0000 (13:11 +0100)]
configure.ac: Enable assert(3) by default.
Revert 167e542b that disabled assert(3) by default after private email
discussion with David Levine. I suggested it being enabled by default
in git so those running nmh from git were testing them for us, with it
being disabled as part of the release process so packagers continue with
it disabled. David pointed out that would make the release
intentionally different from what had been long tested so the default is
now enabled assertions everywhere unless disabled by a packager.
Ralph Corderoy [Fri, 28 Apr 2017 21:08:26 +0000 (22:08 +0100)]
sbr/oauth.c: Remove const to avoid curl's debug_callback warning.
The curl_debug_callback typedef doesn't use const for any of the
function's parameters. On one of the compilation platforms here,
oauth.c's debug_callback having const specifiers for some parameters
causes compilation warnings, and -Werror stops the compilation. Remove
them.
Ralph Corderoy [Fri, 28 Apr 2017 16:12:31 +0000 (17:12 +0100)]
sbr/folder_read.c: Don't malloc() once per message.
Instead of struct msgs having a pointer to a malloc'd array of pointers,
each to a malloc'd struct bvector, 1+N, it now has a pointer to a
malloc'd array of struct bvector; one malloc for all of them. This
avoids the large number of calls to malloc() and free() that's linear
with the size of the folder.
But there are some downsides. In order to step through an array of
struct bvector, code outside of sbr/vector.c needs to know the struct's
size. The simplest way to do this is to make the struct's definition
public, with a comment that access should be through vector.c.
New functions are needed to initialise the content of an already
allocated bvector, and to finish with its content prior to deallocation.
bvector_create() and bvector_free() now also use these new functions.
Before, it was the array of pointers to bvector that would be realloc'd.
That doesn't work for the array of bvectors as they may contain pointers
to within themselves. The solution is to malloc a new array and
bvector_copy() the ones to keep across, as folder_realloc() now does.
The other half of its logic that coped with growth at the end of the
array, has been deleted. Also deleted, is the code to clear the
bvectors before and after the old ones as they start in that state.
Ralph Corderoy [Fri, 28 Apr 2017 11:45:26 +0000 (12:45 +0100)]
sbr/vector.c: Only allocate bvector storage for set bits.
Now that a pointer to all the bit storage doesn't escape vector.c,
there's no need to allocate storage, that's initialised to 0, just to
then clear a bit in it. Only extend the bits's storage for set bits.
Remove the unused initial size parameter from bvector_create() as it
simplifies the implementation.
Ralph Corderoy [Wed, 26 Apr 2017 23:14:28 +0000 (00:14 +0100)]
sbr/vector.c: Zero the growth with memset(3), not loop.
When the resize functions grow the vectors, as they always do, zero the
new slots with memset(3) rather than a for loop. Particularly of note
for the bit vector where it was bvector_clear()ing one bit at a time.
Although a NULL pointer needn't have a representation of all-zero bits,
the code was already assuming that, e.g. on the initial allocation, so
we're no worse off.
Ralph Corderoy [Wed, 26 Apr 2017 12:41:27 +0000 (13:41 +0100)]
sbr/vector.c: Change bvector_bits() to return first word.
Rename it to bvector_first_bits() to represent its new behaviour. It
has only one caller that uses it to produce debug. Tighten the API so
other callers don't get access to the location of the bvector's bits.
Ralph Corderoy [Tue, 25 Apr 2017 22:21:21 +0000 (23:21 +0100)]
sbr/vector.c: Embed initial vector storage in header struct.
Instead of a malloc(3)'d struct bvector having a pointer to a separately
malloc'd area for the bits in the normal case of the default initial
size, have non-malloc'd storage in the struct itself, and set the
pointer to that. It's two unsigned longs, which is less than the
previous default of 256 bits for struct bvector, but still double the
pre-bvector norm of one word on 32 and 64-bit architectures.
This halves the mallocs needed to create a struct bvector in the common
case, but does mean that embedded memory is wasted should it not be
enough. That's probably an unusual case. It also means derefencing the
pointer to the bits probably hits the same cache line.
Have separate initial sizes for string and int vectors.
Ralph Corderoy [Mon, 24 Apr 2017 23:20:08 +0000 (00:20 +0100)]
sbr/vector.c: Remove `vec' argument from BVEC_OFFSET(), etc.
A few macros took a struct bvector pointer as their first argument just
to get the sizeof one of its fields. This can be done with a NULL
pointer so remove that argument from all of them.
Ralph Corderoy [Mon, 24 Apr 2017 22:47:46 +0000 (23:47 +0100)]
sbr/vector.c: Delete unused bvector_maxsize() and ivector_size().
May as well keep the proffered interface as small as possible so the
implementation can make greater assumptions knowing some internal
details aren't available.
Ralph Corderoy [Mon, 24 Apr 2017 21:58:56 +0000 (22:58 +0100)]
sbr/vector.c: Move assert(3)s into bvector_create().
Rather than asserting on every bit-related operation, assert just when
creating the vector. Whether the assertion is true is decided at
compile time, so it only needs checking once; once for every vector is
an easy compromise.
Ralph Corderoy [Mon, 24 Apr 2017 20:19:56 +0000 (21:19 +0100)]
sbr/vector.c: calloc(3) rather than malloc(3) and memset(3).
calloc(3) tells libc upfront that the memory needs to be zeroed rather
than giving it the news later with memset. Perhaps this allows it to
allocate from a CoW page of zero bytes, or it benefits from some other
way in memset not having to clear each bit, but it knocks about 15% off
the wall-clock time and the number of library calls under ltrace(1).
Ralph Corderoy [Tue, 25 Apr 2017 23:14:26 +0000 (00:14 +0100)]
sbr/fmt_scan.c: Only wcwidth(3) a valid mbtowc(3) result.
The assert(3) added by 80a9e99f7078199500d2d53c8d77d1b92af06fbc is
failing, but not reproducibly. It's probable that mbtowc() is returning
a negative, and not altering wide_char, leaving it as random stack
content. Taking its wcwidth() then sometimes also returns negative,
causing the assert() failure. Initialising wide_char before the call
isn't a solution as it isn't documented if it's modified to an invalid
value on an error return.
Instead, delay calculating the wcwidth() until after the possible
substitution of "?". Leave the assert() in place.
Code that's conditional on its definition has been kept; just the tests
removed. The comment explaining its purpose has been kept, just without
the "If defined...".
Ralph Corderoy [Sun, 23 Apr 2017 13:36:57 +0000 (14:36 +0100)]
mh.h: Remove unused MODIFIED and DELETED macros.
MODIFIED was a folder attribute used by msh(1), and DELETED a message
attribute that may have last been used in the 1980s.
Adjust corresponding FBITS and MBITS definitions.
Ralph Corderoy [Sun, 23 Apr 2017 12:33:57 +0000 (13:33 +0100)]
tws.h: Remove zero-valued TW_SNIL macro, used once.
Macro TW_SNIL was the zero value for a two-bit field. It was only used
once, and that is better ordered to check for bits set with none set
being the last, else, case, rather than in the middle. Kept the
behaviour of undefined value 3 being treated as TW_SIMP, 2.
Ralph Corderoy [Sun, 23 Apr 2017 12:18:20 +0000 (13:18 +0100)]
tws.h: Remove redundant TW_SZONE and TW_SZNIL bit-masks.
TW_SZONE is a mask of one bit that can either be 0, TW_SZNIL, or set,
TW_SZEXP. Rather than three macros for the single bit, have one,
TW_SZEXP. Simplifies the test too when the mask is known to be one-bit
wide.
Two of the column headings were separated by spaces rather than a tab.
The other two similar tables used `Return' instead of `Result', so
switch to that at the same time.
Ralph Corderoy [Sun, 23 Apr 2017 08:48:13 +0000 (09:48 +0100)]
sbr/dtime.c: Remove struct-assigning twscopy().
Historically, twscopy() assigned all the struct's members individually,
but that was `#if 0'd as far back as the repository goes. The one
caller now uses assignment, making clear the direction of data flow.
Ralph Corderoy [Sun, 23 Apr 2017 08:28:24 +0000 (09:28 +0100)]
uip/sortm.c: Remove unused variable and dlocaltimenow() call.
read_hdrs() has a auto `struct tws tb'. It's only use is as the
destination for dlocaltimenow()'s result. Remove the variable. Also
remove the dlocaltimenow() call; this I'm unsure about as it has a side
effect of calling tzset(3). However, tests pass. Repository history
sheds no light.
Ralph Corderoy [Sat, 22 Apr 2017 22:36:19 +0000 (23:36 +0100)]
Correct first-line comments where filename is wrong.
Having jiggled the start-of-file comment about, it's apparent that a few
files either don't know their own name, or the format the rest of the
gang follow. To wit, "basename(filename) -- ...".
Ralph Corderoy [Sat, 22 Apr 2017 22:19:39 +0000 (23:19 +0100)]
Replace "/*\n * " with "/* " at the start of a file.
Pulls the meat of the comment onto the file's first line, now that the
blank line at the start of the file has been deleted. Allows `sed -ns
1p ...' or similar to give a summary of lots of files.
Ralph Corderoy [Sat, 22 Apr 2017 18:05:32 +0000 (19:05 +0100)]
Remove leading and trailing blank lines from code.
Many of the main source file start with a blank line, and many end with
one or more too. They seem to serve little purpose so delete them.
This makes all the files consistent in not having them and ideally the
first line of a C source file will say something about its content in
time instead of just "/*". Avoided editing format files, etc,. where
the line could be significant, and historical files.
Ralph Corderoy [Sat, 22 Apr 2017 17:32:55 +0000 (18:32 +0100)]
mts/smtp/smtp.h: Remove unused `MMDF' macros.
The header file has a specification of a historical interface and
removing the unused parts leaves it incomplete, but it's unlikely those
parts will be needed and what remains seems more understandable; partly
due to it being less to grok, and also because it relates more to modern
use.
isfrom(s) checked for /^>?From /.
lequal(a, b) was another case-insensitive strcmp(3).
mfgets(fp, line) fetched a header at a time from fp, indicating when the
end of headers was reached.
Investigating their historical usage was, ironically, made too tedious
to complete by the pollution of mass commits of historical MH.