swift

Author	SHA1	Message	Date
Peter Portante	5202b0e586	DiskFile API, with reference implementation Refactor on-disk knowledge out of the object server by pushing the async update pickle creation to the new DiskFileManager class (name is not the best, so suggestions welcome), along with the REPLICATOR method logic. We also move the mount checking and thread pool storage to the new ondisk.Devices object, which then also becomes the new home of the audit_location_generator method. For the object server, a new setup() method is now called at the end of the controller's construction, and the _diskfile() method has been renamed to get_diskfile(), to allow implementation specific behavior. We then hide the need for the REST API layer to know how and where quarantining needs to be performed. There are now two places it is checked internally, on open() where we verify the content-length, name, and x-timestamp metadata, and in the reader on close where the etag metadata is checked if the entire file was read. We add a reader class to allow implementations to isolate the WSGI handling code for that specific environment (it is used no-where else in the REST APIs). This simplifies the caller's code to just use a "with" statement once open to avoid multiple points where close needs to be called. For a full historical comparison, including the usage patterns see: https://gist.github.com/portante/5488238 (as of master, 2b639f5, Merge "Fix 500 from account-quota This Commit middleware") --------------------------------+------------------------------------ DiskFileManager(conf) Methods: .pickle_async_update() .get_diskfile() .get_hashes() Attributes: .devices .logger .disk_chunk_size .keep_cache_size .bytes_per_sync DiskFile(a,c,o,keep_data_fp=) DiskFile(a,c,o) Methods: Methods: .__iter__() .close(verify_file=) .is_deleted() .is_expired() .quarantine() .get_data_file_size() .open() .read_metadata() .create() .create() .write_metadata() .delete() .delete() Attributes: Attributes: .quarantined_dir .keep_cache .metadata DiskFileReader() Methods: .__iter__() .close() Attributes: +.was_quarantined DiskWriter() DiskFileWriter() Methods: Methods: .write() .write() .put() .put() * Note that the DiskFile class * Note that the DiskReader() object implements all the methods returned by the necessary for a WSGI app DiskFileOpened.reader() method iterator implements all the methods necessary for a WSGI app iterator + Note that if the auditor is refactored to not use the DiskFile class, see https://review.openstack.org/44787 then we don't need the was_quarantined attribute A reference "in-memory" object server implementation of a backend DiskFile class in swift/obj/mem_server.py and swift/obj/mem_diskfile.py. One can also reference https://github.com/portante/gluster-swift/commits/diskfile for the proposed integration with the gluster-swift code based on these changes. Change-Id: I44e153fdb405a5743e9c05349008f94136764916 Signed-off-by: Peter Portante <peter.portante@redhat.com>	2013-10-17 15:03:31 -04:00
ZhiQiang Fan	f72704fc82	Change OpenStack LLC to Foundation Change-Id: I7c3df47c31759dbeb3105f8883e2688ada848d58 Closes-bug: #1214176	2013-09-20 01:02:31 +08:00
Chmouel Boudjnah	d3bd30da41	Fix sync test when localhost on port 80 is binded - When localhost:80 was binding the tests was trying to connect into it. - To test you can simply run sudo python -m SimpleHTTPServer 80 which should show : 1.0.0.127.in-addr.arpa - - [06/Aug/2013 14:10:42] code 501, message Unsupported method ('DELETE') 1.0.0.127.in-addr.arpa - - [06/Aug/2013 14:10:42] "DELETE /a/c/o HTTP/1.1" 501 - (the test was passing since 501 would raise ClientException). mock delete_object in the fourth test to fix that - Refactor the code to use mock.patch as well. Closes-Bug: 1208802 Change-Id: I5ddd4ac3a97879f51cf5883fcfc0fe0f0adaeff6	2013-09-11 15:49:48 -04:00
Dirk Mueller	3d36a76156	Use Python 3.x compatible except construct except x,y: was deprected and is removed in Python 3.x. Use "except x as y:" instead which works in any Python version >= 2.6. Change-Id: I7008c74b807340f3457d3a0c8bd0b83f23169d14	2013-09-07 10:50:54 +02:00
Peter Portante	1d9213bed4	Pep8 container test_sync and test_diskfile (9 of 12) Change-Id: I469e3b75b6846fddb5e9d2b8317506129836d0a6 Signed-off-by: Peter Portante <peter.portante@redhat.com>	2013-09-01 16:12:51 -04:00
Peter Portante	9480ff8a28	Refactor auditors to rely on expected gen names Refactor the various auditors to rely on the audit_location_generator yielding tuples containing paths with the expected suffix. We also fix the exception handling for container_sync to not expect a broker object (since the act of creating a broker object can raise an exception). For the object auditor we removed an unneeded check for disk_file since get_data_file_size() will raise DiskFileNotExist under the same condition (raises code coverage slightly). Change-Id: I11d405e629063177ef21543b75e9076da1a03b61	2013-07-15 17:29:42 -04:00
gholt	0a77f04893	Added utils.FileLikeIter A really simple version of this was in container sync already, and I needed a more complete version for work I'm doing, and I noticed https://review.openstack.org/#/c/33405/ was also making use of it. So, here's a more full version. If https://review.openstack.org/#/c/33405/ lands before this, I'll update it accordingly. Change-Id: Iba66b6a97f65e312e04fdba273e8f4ad1d3e1594	2013-06-19 18:21:52 +00:00
David Hadas	a979c8007b	Add support for Hash Prefix A new configuration parameter is added to /etc/swift/swift.conf [swift-hash] swift_hash_path_prefix = 'random unique string' New installations are advised to set this parameter to a random secret, which would not be disclosed ouside the organization. The same secret needs to be used by all swift servers of the same cluster. Existing installations should set this parameter to an empty string (the default) DocImpact Fixes: Bug #1157454 Change-Id: I63b10d0b7d6dd3f74e0f10bb41b5f240fa03578a	2013-03-22 19:41:55 +02:00
David Hadas	8b140033f0	Improved container-sync resiliency container-sync now skips faulty objects in the first and second rounds. All replicas try in the second round. No server will give up until the faulty object suceeds Fixes: bug #1068423 Change-Id: I0defc174b2ce3796a6acf410a2d2eae138e8193d	2013-03-08 22:28:06 +02:00
Joe Gordon	45f0502b52	Fix spelling mistakes git ls-files \| misspellings -f - Source: https://github.com/lyda/misspell-check Change-Id: I4132e6a276e44e2a8985238358533d315ee8d9c4	2013-02-12 16:39:40 -08:00
Dan Hersam	923b8b95c3	Fix for container sync not syncing last rowid Bug 1079439 Change-Id: I1985a7176f34931ecb0f24c0289f18df5e934598	2012-11-19 23:09:17 +00:00
Donagh McCabe	00c3fde8f8	Handle down object servers in container-sync If an object server is down, container-sync stops syncing the container even if the it gets object copies from "up" obejct servers. Bug 1069910 In case the git history gets mangled, this fix was done almost entirely by Donagh McCabe <donagh.mccabe@hp.com>. Change-Id: Ieeadcfeb4e880fe5f08e284d7c12492bf7a29460	2012-11-01 16:45:35 -07:00
Chmouel Boudjnah	22572b506a	Remove swift.common.client from here. - It has been to its own gerrit project. - direct_client should follow next. - Implements blueprint clientbindings. Change-Id: I3bb50c95eba81302bfec71cb7ce5288b85a41dc0	2012-06-07 16:36:49 +02:00
Darrell Bishop	3d3ed34f44	Adding StatsD logging to Swift. Documentation, including a list of metrics reported and their semantics, is in the Admin Guide in a new section, "Reporting Metrics to StatsD". An optional "metric prefix" may be configured which will be prepended to every metric name sent to StatsD. Here is the rationale for doing a deep integration like this versus only sending metrics to StatsD in middleware. It's the only way to report some internal activities of Swift in a real-time manner. So to have one way of reporting to StatsD and one place/style of configuration, even some things (like, say, timing of PUT requests into the proxy-server) which could be logged via middleware are consistently logged the same way (deep integration via the logger delegate methods). When log_statsd_host is configured, get_logger() injects a swift.common.utils.StatsdClient object into the logger as logger.statsd_client. Then a set of delegate methods on LogAdapter either pass through to the StatsdClient object or become no-ops. This allows StatsD logging to look like: self.logger.increment('some.metric.here') and do the right thing in all cases and with no messy conditional logic. I wanted to use the pystatsd module for the StatsD client, but the version on PyPi is lagging the git repo (and is missing both the prefix functionality and timing_since() method). So I wrote my swift.common.utils.StatsdClient. The interface is the same as pystatsd.Client, but the code was written from scratch. It's pretty simple, and the tests I added cover it. This also frees Swift from an optional dependency on the pystatsd module, making this feature easier to enable. There's test coverage for the new code and all existing tests continue to pass. Refactored out _one_audit_pass() method in swift/account/auditor.py and swift/container/auditor.py. Fixed some misc. PEP8 violations. Misc test cleanups and refactorings (particularly the way "fake logging" is handled). Change-Id: Ie968a9ae8771f59ee7591e2ae11999c44bfe33b2	2012-05-11 15:25:38 -07:00
Samuel Merritt	bb509dd863	As-unique-as-possible partition replica placement. This commit introduces a new algorithm for assigning partition replicas to devices. Basically, the ring builder organizes the devices into tiers (first zone, then IP/port, then device ID). When placing a replica, the ring builder looks for the emptiest device (biggest parts_wanted) in the furthest-away tier. In the case where zone-count >= replica-count, the new algorithm will give the same results as the one it replaces. Thus, no migration is needed. In the case where zone-count < replica-count, the new algorithm behaves differently from the old algorithm. The new algorithm will distribute things evenly at each tier so that the replication is as high-quality as possible, given the circumstances. The old algorithm would just crash, so again, no migration is needed. Handoffs have also been updated to use the new algorithm. When generating handoff nodes, first the ring looks for nodes in other zones, then other ips/ports, then any other drive. The first handoff nodes (the ones in other zones) will be the same as before; this commit just extends the list of handoff nodes. The proxy server and replicators have been altered to avoid looking at the ring's replica count directly. Previously, with a replica count of C, RingData.get_nodes() and RingData.get_part_nodes() would return lists of length C, so some other code used the replica count when it needed the number of nodes. If two of a partition's replicas are on the same device (e.g. with 3 replicas, 2 devices), then that assumption is no longer true. Fortunately, all the proxy server and replicators really needed was the number of nodes returned, which they already had. (Bonus: now the only code that mentions replica_count directly is in the ring and the ring builder.) Change-Id: Iba2929edfc6ece89791890d0635d4763d821a3aa	2012-05-09 15:56:06 -07:00
John Dickinson	1ecf5ebba1	updated copyright date for all files Change-Id: Ifd909d3561c2647770a7e0caa3cd91acd1b4f298	2012-03-19 13:45:34 -05:00
gholt	f9fb4c739c	consync: Now queries all primary nodes for a put and uses the newest object if it is newer or equal to the object to sync	2011-06-16 01:31:51 +00:00
gholt	78d417dda7	consync: More tests and slight refactor to be more testable	2011-06-16 00:59:55 +00:00
gholt	6587fd914e	consync: Some more tests and bugfixes.	2011-06-15 02:01:01 +00:00

19 Commits