Commit graph

82 commits

Author SHA1 Message Date
Matthew Wild
f19f1088b7 mod_http (and dependent modules): Make CORS opt-in by default (fixes #1731)
The same-origin policy enforced by browsers is a security measure that should
only be turned off when it is safe to do so. It is safe to do so in Prosody's
default modules, but people may load third-party modules that are unsafe.

Therefore we have flipped the default, so that modules must explicitly opt in
to having CORS headers added on their requests.
2022-03-28 14:53:24 +01:00
Kim Alvefur
ca19260145 mod_http_file_share: Use alternate syntax for filename in Content-Disposition
The Lua string.format %q doesn't behave correctly for all characters
that should be escaped in a quoted-string. And who knows what effects
higher Unicode might have here.

Applying percent-encoding of filenames seems like the safest way to deal
with filenames, as well as being easier than implementing the actual
quoted-string transform, which seems complicated and I'm not even sure
it covers every possible character.

Filenames can safely be assumed to be UTF-8 since they are passed in an
attribute in the query without any escaping.
2022-01-29 16:11:38 +01:00
Kim Alvefur
ec9ec1b9a0 mod_http_file_share: Always measure total disk usage for statistics!
Metrics available or not depending on configuration is weird, even tho
it might be expensive to calculate and it's only really needed when
there is a global quota.

Default quota is set to infinity, which is essentially what it was.

Reports NaN if there is an error, which should count as over the
infinite default quota.
2022-01-11 04:15:29 +01:00
Kim Alvefur
8bef874d8f mod_http_file_share: Fix to take retention time into account
It was lost in 6f4790b8deec when switching to mod_cron.
2021-12-04 16:48:39 +01:00
Kim Alvefur
ce345d2908 mod_http_file_share: Improve consistency of terminology in logging
Prefer 'prune' over 'delete' since it more strongly implies removal of
excess.
2021-12-04 15:03:26 +01:00
Kim Alvefur
7a1198b727 mod_http_file_share: Rename variable for clarity 2021-12-04 15:03:52 +01:00
Kim Alvefur
3e7177a533 mod_http_file_share: Fix deletion counter
Before aa60f4353001 each loop had its own counter, seems incrementing of
one of them was lost. But only one is needed anyhow.
2021-12-04 14:28:04 +01:00
Kim Alvefur
d4f8f23818 mod_http_file_share: Back out 876e1b6d6ae4 2021-12-03 10:43:02 +01:00
Kim Alvefur
9950712ba3 mod_http_file_share: Recalculate total storage usage weekly instead of daily
Before mod_cron this job ran less frequently than the upload removal
job. Running them at the same frequency seems wasteful somehow, plus the
total should not drift away from the true value that fast.
2021-12-03 09:08:23 +01:00
Kim Alvefur
62ff00cd1f mod_http_file_share: Keep track of total storage use across restarts
The value needs to be known in order to determine if additional uploads
can be accepted.
2021-12-03 08:21:29 +01:00
Kim Alvefur
a3d8a25bdd mod_http_file_share: Fix measuring total storage use before it was known
Passing nil to the metrics system causes errors later.
2021-12-03 08:16:18 +01:00
Kim Alvefur
82f6358657 mod_http_file_share: Keep global storage use accurate longer.
Merging those loops removes the miscounting that would occur in case a
file could not be deleted, so no need to limit it to that case.
2021-11-30 00:55:54 +01:00
Kim Alvefur
d2f4a57bfc mod_http_file_share: Merge file expiry loops
Not sure what the benefit of two separate loops was, perhaps reduced
memory usage by allowing archive query state to be garbage collected
before moving on to deleting files. Never measured so probably not so.

This simplifies a bit.
2021-11-30 00:53:22 +01:00
Kim Alvefur
4836354fd0 mod_http_file_share: Limit query to time since last expiry
This is probably a bad idea, as files that could not be deleted for some
reason will not be tried again. I was just thinking of what the 'task'
argument could be used for.
2021-11-30 00:52:07 +01:00
Kim Alvefur
a70a8c4ffc mod_http_file_share: Switch to mod_cron for periodic tasks 2021-11-22 14:54:32 +01:00
Kim Alvefur
7a272b1670 mod_http_file_share: Move number coercion into util.dataforms 2021-10-25 21:45:06 +02:00
Kim Alvefur
092875aaec mod_http_file_share: Silence luacheck warning 2021-10-23 13:32:01 +02:00
Kim Alvefur
9335877898 mod_http_file_share: Clean up incomplete uploads
If the request fails in the middle then the file~ could be left behind
because no code was invoked to delete it then.  This gets rid of it when
the request is removed. It may still be left in case of an unclean
shutdown.
2021-10-23 01:53:07 +02:00
Kim Alvefur
aed2ada774 mod_http_file_share: Allow 'Authorization' header via CORS (thanks kawaii)
Can't find anything saying anything on whether this is needed or not.
kawaii reported that both Chrome and Firefox complained unless the
header was added to the list of allowed headers.
2021-10-20 23:06:07 +02:00
Jonas Schäfer
3e55057a85 mod_http_file_share: return 401 instead of 403 if authentication failed
This is as per the HTTP standards [1]. Thankfully, the REQUIRED
www-authenticate header is already generated by the code.

   [1]: https://datatracker.ietf.org/doc/html/rfc7235#section-3.1
2021-10-19 16:37:32 +02:00
Kim Alvefur
72e0ff88ba mod_http_file_share: Fix measuring how long periodic task take 2021-09-14 15:33:39 +02:00
Kim Alvefur
9af7bb723b mod_http_file_share: Measure current total usage
In order to allow monitoring. Especially as there's not much in the way
of hard numbers on how much space gets used.
2021-09-12 17:03:02 +02:00
Kim Alvefur
5392ef1dd6 mod_http_file_share: Fix traceback in global quota debug logging (thanks Martin)
Error in util.human.units.format because of B(nil) when the global quota
is unset.
2021-09-12 11:47:22 +02:00
Kim Alvefur
ae7f6c34f0 mod_http_file_share: Add optional global quota on total storage usage
Before, maximum storage usage (assuming all users upload as much as they
could) would depend on the quota, retention period and number of users.
Since number of users can vary, this makes it hard to know how much
storage will be needed.

Adding a limit to the total overall storage use solves this, making it
simple to set it to some number based on what storage is actually
available.

Summary job run less often than the prune job since it touches the
entire archive; and started before the prune job since it's needed
before the first upload.
2021-09-12 01:38:33 +02:00
Kim Alvefur
1904899421 mod_http_file_share: Update comment about x-frame-options
X-Frame-Options was replaced by the Content-Security-Policy
'frame-ancestors' directive, but Internet Explorer does not support that
part of CSP.

Since it's just one line it doesn't hurt to keep until some future
spring cleaning event :)
2021-06-20 16:30:24 +02:00
Kim Alvefur
2f7da2c6ea mod_http_file_share: Build list of measuring buckets for configured size limit
Creates buckets up to the configured size limit or 1TB, whichever is
smaller, e.g. {1K, 4K, 16K, ... 4M, 16M}
2021-06-08 13:33:40 +02:00
Kim Alvefur
a854ec1e4c mod_http_file_share: Handle out of bounds Range request
Turns out you can seek past the end of the file without getting an
error.

Also rejects empty range instead of sending the whole file.
2021-05-17 14:14:25 +02:00
Kim Alvefur
f5757cae7f mod_http_file_share: Support download resumption via Range requests
Only a starting point is supported due to the way response:send_file()
sends everything it gets from the provided file handle but does not have
any way to specify how much to read.

This matches what Conversations appears to be doing.
2021-05-16 16:52:59 +02:00
Kim Alvefur
ecd6b31eea mod_http_file_share: Include expiry time of the upload itself in token
Lets an external upload service know this so it can do expiry itself.
Could possibly have been calculated based on the token expiry or
issuance time, explicit > implicit.
2021-04-05 17:21:18 +02:00
Kim Alvefur
e590c9c92e mod_http_file_share: Include time of issuance in auth token
In case an external upload service wants to have the original creation
time, or calculate the token expiry itself.
2021-04-05 17:16:18 +02:00
Kim Alvefur
bb06c42b07 mod_http_file_share: Group related properties for readability 2021-04-05 17:15:12 +02:00
Kim Alvefur
f7131f81f2 mod_http_file_share: Log error opening file for writing
util.error.coerce() doesn't work well with iolib
2021-04-05 16:24:39 +02:00
Kim Alvefur
de641f6e24 mod_http_file_share: Fix logging of error opening file
It's annoying that Lua interpolates the filename into the error message.
2021-04-05 16:23:59 +02:00
Kim Alvefur
d8613bb7b9 mod_http_file_share: Include storage in prune time measurement
Maybe the original idea was that you would measure storage separately?
2021-03-31 09:43:05 +02:00
Kim Alvefur
29a56e7747 mod_http_file_share: Add internal command to check files consistency
Background: Found a few files in my store that did not match the size
recorded in the slot, so I needed a way to check which which those were.
As it was a bit too much to type into the shell I added it here instead.
2021-03-28 15:57:15 +02:00
Kim Alvefur
1be4a5c297 mod_http_file_share: Fix reporting of missing files
This just gave an unhelpful 500 error.

It would be nice to have some wrapper code that could untangle the
embedded filename in the io libs errors.
2021-03-28 13:51:06 +02:00
Kim Alvefur
9631d45fcc mod_http_file_share: Report number of items in caches to statsmanager
This is neat, O(1) reporting, why don't we do this everywhere?
Gives you an idea of how much stuff is in the caches, which may help
inform decisions on whether the size is appropriate.
2021-03-28 13:15:11 +02:00
Kim Alvefur
dfaf43cc3f mod_http_file_share: Skip removal of nothing
In case none of the expired files could be deleted then it's a waste of
an API call to try to remove any of the metadata at all.
2021-02-26 00:20:24 +01:00
Kim Alvefur
a2648b9d64 mod_http_file_share: Remove correct entries when not all expired files were deleted
If any of the expired files could not be deleted then we should not
forget about that, we should complain loudly and try again.

The code got this backwards and would have removed only the entries
referring to still existing files.

Test procedure:
1.  Upload a file
2.  chown root:root http_file_share/
3.  In uploads.list, decrease 'when' enough to ensure expiry
4.  Reload mod_http_file_share
5.  Should see an error in the logs about failure to delete the file
6.  Should see that the metadata in uploads.list is still there
7.  chown http_file_share/ back to the previous owner
8.  Reload mod_http_file_share
9.  Should see logs about successful removal of expired file
10. Should see that the metadata in uploads.list is gone
11. Should see that the file was deleted
2021-02-25 23:58:08 +01:00
Kim Alvefur
0a5ac437b9 mod_http_file_share: Fix traceback on missing file-type
attempt to index a nil value (local 'filetype') casued by the :gsub call
2021-02-24 16:35:56 +01:00
Kim Alvefur
b01915e81c mod_http_file_share: Allow credentials via CORS (needed for auth token) 2021-02-23 02:56:49 +01:00
Kim Alvefur
900fe6a1cb mod_http_file_share: Return a message from the base URL 2021-02-21 06:49:11 +01:00
Kim Alvefur
3151984042 mod_http_file_share: Prevent attempt to upload again after completion 2021-02-13 14:14:12 +01:00
Kim Alvefur
9501c7b976 mod_http_file_share: Support sending 100 Continue
E.g. curl will ask for this when sending large uploads. Removes a delay
while it waits for an error or go-agead.
2021-02-13 14:06:46 +01:00
Kim Alvefur
cdbb77ccbf mod_http_file_share: Collect cache hit/miss statistics for downloads 2021-02-02 22:16:20 +01:00
Kim Alvefur
e45dee37bd mod_http_file_share: Measure how long it takes to prune expired files 2021-02-02 22:11:53 +01:00
Kim Alvefur
da5cd9c963 mod_http_file_share: Collect statistics of files uploaded 2021-02-02 22:08:51 +01:00
Kim Alvefur
5d9802727b mod_http_file_share: Reorder arguments
'filetype' is optional, so having it last seems sensible.
'slot' is pretty important, so moving it earlier seems sensible.
2021-01-31 20:38:40 +01:00
Kim Alvefur
f5baa83714 mod_http_file_share: Update cached value while it is reasonably fresh
This should ensure that cache entries until the oldest file that counted
to the last 24h becomes older than 24h.
2021-01-31 17:44:19 +01:00
Kim Alvefur
864708efb7 mod_http_file_share: Cache quotas to avoid hitting storage 2021-01-31 17:56:49 +01:00