Merge "Create spec for optional subunit data import"

2017-09-05 18:33:17 +00:00 · 2017-09-05 18:33:17 +00:00 · fe2eca2ea6
commit fe2eca2ea6
parent 84be8b1766 2832d3a56f
1 changed files with 315 additions and 0 deletions
--- a/specs/pike/approved/upload-subunit-tests.rst
+++ b/specs/pike/approved/upload-subunit-tests.rst
@ -0,0 +1,315 @@
 =============================================
 Upload Subunit Data From Test Results
 =============================================
 Launchpad blueprint:
 * https://blueprints.launchpad.net/refstack/+spec/subunit-data-upload
 This specification describes an expansion of the RefStack API's
 functionality to allow for upload of the subunit data which corresponds
 to a given set of test results.
 Problem description
 ===================
 Currently, all test results uploaded to the RefStack website consists
 of a json file containing only the portion of the RefStack run pertaining
 to the passed tests. This limitation dates back to the the start of the
 RefStack project. At that time, Defcore (which is now known as interop-WG)
 was very concerned about the possibility that private data may be included
 in the subunit upload file. Defcore was concerned that vendors might, for
 that reason, be hesistant to upload data into RefStack for fear of
 unintentionally revealing vendor-specific data such as reasons for test
 failures.  For this reason, Defcore agreed unanimously that RefStack should
 care only about passing tests, and not failed or skipped ones.
 The risk, with this resolution, however, is that not including a full set of
 results means that it would be fairly simple to falsify those results in
 order to make an OpenStack instance appear to be more interoperable than
 it actually is. This too, was discussed at the time, and Defcore eventually
 arrived at the conclusion that, in the end, it would be better to accept
 vendor results in good faith, but to always leave the door open for users
 and Foundation staff to verify those results independently. This decision
 did not, however, account for the possibility that vendors seeking support
 during the process of verifying the interoperability of their product may
 need a way to securely share subunit data for review by Foundation staff.
 Proposed change
 ===============
 In order to move towards having a more reliable and verifiable collection
 of RefStack results, we intend to add functionality to the RefStack
 toolkit that will allow for the upload of the subunit data related to a given
 set of test results. This data would be private, only accessible to the party
 uploading it, and to foundation staff, to be used for result integrity
 verification as well as debugging assistance. Upload of subunit data will not,
 for the time being, be required.
 After discussing a number of data storage methods at the 7/18/17 RefStack
 meeting[3], we settled upon saving the newly usable subunit data in the
 current database. With a small adjustment to our alembic settings (which
 will keep the version tables from colliding), this could be done using the
 existing subunit2sql toolkit[4]. In order to apply the table name change,
 we will build in a series of functions that check refstack.conf and rename
 the existing alembic version table if needed. This added functionality,
 when merged and functional, will make RefStack one of only two OpenStack
 projects (according to oslo.conf docs[7])that is currently capable of
 modifying configuration at runtime without a service restart. The usage of
 subunit2sql will do a lot of the heavy lifting for us, as far as data import
 goes, as well as keeping the storage method of test data consistent across
 the board.
 For the time being, we plan to link the subunit data will be linked to the
 corresponding test results via a key value pair in the metadata table that
 is an existing part of the RefStack database.
 Toolset to use:
 subunit2sql
 Alternatives
 ------------
 Though we did eventually decide upon storing the new data in a new, separate
 database, a few alternate options were discussed during the 7/18/17 RefStack
 meeting[3]. The alternate options discussed were:
 * Save subunit files as-is in a file system. This has the benefit of being the
  least processing-intensive option for saving the data, as it would literally
  just save the output into a file. It may, however, make subunit data upload
  a bit less elegant, as well as being a deviation from the way test run data
  is managed throughout the rest of RefStack.
 * Save subunit data in the RefStack database and tables by building in the
  functionality required to save and manage it. Like the option listed above,
  this option keeps test run data stored consistently across refstack, which
  would make the changes to the API more consistent as well. It would also
  avoid the overhead that would result from using a separate database, as well
  as any redundancies that resulted from using a second, separate database.
  However, any redundancy would be fairly minor due to the extremely limited
  scope of the data we are currently storing from each test run, and this would
  leave more of the implementation up to us, which, because of how well
  subunit2sql's schema fulfills the needs of this change, may be wholly
  unnecessary.
 * Save subunit data in a separate database created by subunit2sql. This has the
  benefit of having all of the functionality we need without forcing us to
  reinvent the wheel, but it also carries with it the overhead of having to use
  a second database. This option doesn't make much sense, however, given that
  we can actually use subunit2sql's toolkit in the current refstack database,
  as long as we can configure the database to use an extra (differently named)
  alembic version table for refstack's core db.
 Data model impact
 -----------------
 We may be able to use the tables created by subunit2sql within the RefStack
 database. These tables (for reference) are mapped out below:
 --------------------------------------
 |               tests                |
 --------------------------------------
 |   id           |  String(256)      |
 |   test_id      |  String(256)      |
 |   run_count    |  Integer          |
 |   failure      |  Integer          |
 |   run_time     |  Float            |
 --------------------------------------
 ----------------------------------------
 |              runs                    |
 ----------------------------------------
 |  id            |  BigInteger         |
 |  skips         |  Integer            |
 |  fails         |  Integer            |
 |  passes        |  Integer            |
 |  run_time      |  Float              |
 |  artifacts     |  Text               |
 |  run_at        |  DateTime           |
 ----------------------------------------
 ---------------------------------------------------
 |                    test_runs                    |
 ---------------------------------------------------
 |  id                      |  BigInteger          |
 |  test_id                 |  BigInteger          |
 |  run_id                  |  BigInteger          |
 |  status                  |  String(256)         |
 |  start_time              |  DateTime            |
 |  start_time_microseconds |  Integer             |
 |  stop_time               |  DateTime            |
 |  stop_time_microseconds  |  Integer             |
 |  test                    |  Test                |
 |  run                     |  Run                 |
 ---------------------------------------------------
 -------------------------------------------
 |            run_metadata                 |
 -------------------------------------------
 |  id            |  BigInteger            |
 |  key           |  String(255)           |
 |  value         |  String(255)           |
 |  run_id        |  BigInteger            |
 |  run           |  Run                   |
 -------------------------------------------
 -------------------------------------------
 |          test_run_metadata              |
 -------------------------------------------
 |  id            |  BigInteger            |
 |  key           |  String(255)           |
 |  value         |  String(255)           |
 |  test_run_id   |  BigInteger            |
 |  test_run      |  TestRun               |
 -------------------------------------------
 -------------------------------------------
 |            test_metadata                |
 -------------------------------------------
 |  id            |  BigInteger            |
 |  key           |  String(255)           |
 |  value         |  String(255)           |
 |  test_id       |  BigInteger            |
 |  test          |  Test                  |
 -------------------------------------------
 -------------------------------------------
 |            attachments                  |
 -------------------------------------------
 |  id            |  BigInteger            |
 |  test_run_id   |  BigInteger            |
 |  label         |  String(255)           |
 |  attachment    |  LargeBinary           |
 |  test_run      |  TestRun               |
 -------------------------------------------
 more details about this data model can be found in the source docs for
 subunit2sql[5]
 If we end up being unable to integrate the two databases into one at this time,
 we plan to use the metadata table which already exists in the RefStack internal
 db to store a key pair that links the existing test data to the newly added
 subunit data.
 REST API impact
 ---------------
 We will need to implement a new REST API for the  upload of subunit data
 from the client, and then use subunit2sql to process and save the data
 into the database.
 Security impact
 ---------------
 It has been suggested that uploading the subunit data for tests may expose
 private data. However, it was determined in the 6/27/2017 RefStack meeting[1]
 that if any such data is revealed through this upload, it would be due to a
 leak in tempest's logging procedures, not the upload of this new type of data.
 This was also discussed at the 6/28/17 Interop-wg meeting[2]. It was at this
 meeting that was confirmed that we would implement this change using an
 opt-in flag, so that those who are still concerned about the security of
 uploading the results do not, by default, have to upload their data. It was
 also determined that, due to the fact that this design reflects a fairly
 significant reversal in a past decision, that the community should be
 properly notified. This decision also resulted in the following action plan:
 1. write an email to distribute to the mailing list
 2. send out the official decision after the email is distributed
 3. change the offical interop docs to reflect this change
 Another concern was that a database injection attack may be possible, if an
 attacker were to use maliciously crafted subunit data. This threat, also,
 does not appear to be much of a danger, as the mass majority of the data
 written to the database is done after the subunit data is processed, meaning
 that there are very few places in which raw strings are written into the db.
 We need to look a little  more into whether sql does enough input sanitization
 for our needs.
 Notifications impact
 --------------------
 None
 Other end user impact
 ---------------------
 None
 Performance impact
 -------------------
 None
 Other deployer impact
 ---------------------
 We will also need to adjust refstack-client to be able to consume the new API
 feature while uploading subunit data.
 One of the most user-visible part of this change would be the creation of a
 flag option which enables the upload of the subunit data to the refstack site,
 which would modify the existing procedure in that we would need to build in
 functionality that would allow for the additional data upload.
 We would also need to add a second flag to the database sync functionality in
 order to allow for the alternate naming of the alembic version table, which
 enables us to use both subunit2sql and refstack tables and functionality
 within the same database.
 Developer impact
 ----------------
 None
 Implementation
 ==============
 Assignees(s)
 ------------
 Primary assignee:
  Megan Guiney
 Other contributors:
  Paul Van Eck (subunit data upload ui in refstack-client)
 Work Items
 ----------
 * Add a CONF option to allow for the usage of nonstandard alembic
  version table names.
 * Add a utility that allows for the runtime checking and alteration
  of alembic version table names.
 * Create an API at the server side to accept the subunit data
 * At the server side, use subunit2sql to process the subunit data
 * Link subunit data to existing set of refstack results.
 * Create UI to upload subunit data (completed, as of 1/20/2016[6],
  though may require update)
 * Create a UI to display subunit data. There may already be one, but
  we need to make sure such a utility exists. We also need to decide
  whether the results should be viewable via the refstack website.
 Dependencies
 ============
 Testing
 =======
 Documentation Impact
 ====================
 We will need to update the docs to reflect the additions to the API, the
 database, and to refstack-client as well.
 References
 ==========
 [1] http://eavesdrop.openstack.org/meetings/refstack/2017/refstack.
    2017-06-27-19.00.log.html
 [2] http://eavesdrop.openstack.org/meetings/interopwg/2017/interopwg.
    2017-06-28-16.00.log.html
 [3] http://eavesdrop.openstack.org/meetings/refstack/2017/refstack.
    2017-07-18-19.00.log.html
 [4] https://git.openstack.org/cgit/openstack-infra/subunit2sql
 [5] https://docs.openstack.org/subunit2sql/latest/data_model.html
 [6] https://review.openstack.org/#/c/265394/
 [7] https://docs.openstack.org/oslo.config/latest/configuration/
    mutable.html