Commits · 4c93668202f8b3a02e3730e696e9ebcd6f08daca · Forest Godfrey / gitlab-ce

10 Aug, 2017 1 commit

Migrate events into a new format · 0395c471

authored 7 years ago

This commit migrates events data in such a way that push events are
stored much more efficiently. This is done by creating a shadow table
called "events_for_migration", and a table called "push_event_payloads"
which is used for storing push data of push events. The background
migration in this commit will copy events from the "events" table into
the "events_for_migration" table, push events in will also have a row
created in "push_event_payloads".

This approach allows us to reclaim space in the next release by simply
swapping the "events" and "events_for_migration" tables, then dropping
the old events (now "events_for_migration") table.

The new table structure is also optimised for storage space, and does
not include the unused "title" column nor the "data" column (since this
data is moved to "push_event_payloads").

== Newly Created Events

Newly created events are inserted into both "events" and
"events_for_migration", both using the exact same primary key value. The
table "push_event_payloads" in turn has a foreign key to the _shadow_
table. This removes the need for recreating and validating the foreign
key after swapping the tables. Since the shadow table also has a foreign
key to "projects.id" we also don't have to worry about orphaned rows.

This approach however does require some additional storage as we're
duplicating a portion of the events data for at least 1 release. The
exact amount is hard to estimate, but for GitLab.com this is expected to
be between 10 and 20 GB at most. The background migration in this commit
deliberately does _not_ update the "events" table as doing so would put
a lot of pressure on PostgreSQL's auto vacuuming system.

== Supporting Both Old And New Events

Application code has also been adjusted to support push events using
both the old and new data formats. This is done by creating a PushEvent
class which extends the regular Event class. Using Rails' Single Table
Inheritance system we can ensure the right class is used for the right
data, which in this case is based on the value of `events.action`. To
support displaying old and new data at the same time the PushEvent class
re-defines a few methods of the Event class, falling back to their
original implementations for push events in the old format.

Once all existing events have been migrated the various push event
related methods can be removed from the Event model, and the calls to
`super` can be removed from the methods in the PushEvent model.

The UI and event atom feed have also been slightly changed to better
handle this new setup, fortunately only a few changes were necessary to
make this work.

== API Changes

The API only displays push data of events in the new format. Supporting
both formats in the API is a bit more difficult compared to the UI.
Since the old push data was not really well documented (apart from one
example that used an incorrect "action" nmae) I decided that supporting
both was not worth the effort, especially since events will be migrated
in a few days _and_ new events are created in the correct format.

Unverified

0395c471

02 Aug, 2017 1 commit
- Change all `:empty_project` to `:project` · 72a7b30c
  Robert Speicher authored 7 years ago
  
  72a7b30c
27 Jul, 2017 1 commit

Remove superfluous lib: true, type: redis, service: true, models: true,… · ddccd24c

authored 7 years ago

Remove superfluous lib: true, type: redis, service: true, models: true, services: true, no_db: true, api: true
Signed-off-by: Rémy Coutable <remy@rymai.me>

ddccd24c

21 Jun, 2017 1 commit
- Enable Style/DotPosition Rubocop · 0430b764
  Grzegorz Bizon authored 7 years ago
  
  0430b764
04 May, 2017 2 commits
- Update last_repository_updated_at when a push event is created · 91b5aaf7
  Douglas Barbosa Alexandre authored 7 years ago
  
  91b5aaf7
- Backport of multiple_assignees_feature [ci skip] · 387c4b2c
  Valery Sizov authored 7 years ago
  
  387c4b2c
23 Feb, 2017 2 commits
- Revert "Prefer leading style for Style/DotPosition" · 1fe7501b
  Douwe Maan authored 7 years ago
```
This reverts commit cb10b725c8929b8b4460f89c9d96c773af39ba6b.
```
  1fe7501b
- Prefer leading style for Style/DotPosition · 206953a4
  Douwe Maan authored 7 years ago
  
  206953a4
01 Feb, 2017 1 commit
- Add traits for the different Event types to the Event factory · e48a1755
  Robert Speicher authored 7 years ago
  
  e48a1755
26 Jan, 2017 1 commit
- Use `:empty_project` where possible in model specs · fcb37542
  Robert Speicher authored 7 years ago
  
  fcb37542
25 Nov, 2016 1 commit
- Removed data-user-is view code · 0ba03d7e
  Yorick Peterse authored 8 years ago
```
With events no longer being cached this is no longer needed.
```
  Unverified
  
  0ba03d7e
16 Nov, 2016 1 commit
- Allow commit note to be visible if repo is visible · d47fca53
  Rémy Coutable authored 8 years ago
```
Signed-off-by: Rémy Coutable <remy@rymai.me>
```
  Unverified
  
  d47fca53
09 Nov, 2016 1 commit

Merge branch '23403-fix-events-for-private-project-features' into 'security' · b0088b52

authored 8 years ago

Respect project visibility settings in the contributions calendar

This MR fixes a number of bugs relating to access controls and date selection of events for the contributions calendar

Closes https://gitlab.com/gitlab-org/gitlab-ce/issues/23403

See merge request !2019
Signed-off-by: Rémy Coutable <remy@rymai.me>

Unverified

b0088b52

20 Oct, 2016 2 commits

Differentiate the expire from leave event · f488b9f7

authored 8 years ago

At the moment we cannot see weather a user left a project due to their
membership expiring of if they themselves opted to leave the project.
This adds a new event type that allows us to make this differentiation.
Note that is not really feasable to go back and reliably fix up the
previous events. As a result the events for previous expire removals
will remain the same however events of this nature going forward will be
correctly represented.

f488b9f7

Differentiate the expire from leave event · 9124310f

authored 8 years ago

Unverified

9124310f

13 Oct, 2016 1 commit
- Union examples in event spec fo speed up · 48015ded
  Semyon Pupkov authored 8 years ago
  
  48015ded
11 Oct, 2016 1 commit
- Make guests unable to view MRs · b4004488
  Valery Sizov authored 8 years ago
  
  b4004488
04 Oct, 2016 1 commit

Remove lease from Event#reset_project_activity · c9bcfc63

authored 8 years ago

Per GitLab.com's performance metrics this method could take up to 5
seconds of wall time to complete, while only taking 1-2 milliseconds of
CPU time. Removing the Redis lease in favour of conditional updates
allows us to work around this.

A slight drawback is that this allows for multiple threads/processes to
try and update the same row. However, only a single thread/process will
ever win since the UPDATE query uses a WHERE condition to only update
rows that were not updated in the last hour.

Fixes gitlab-org/gitlab-ce#22473

Unverified

c9bcfc63

19 Sep, 2016 1 commit

Restrict last_activity_at updates to one per hour · 028c086f

authored 8 years ago

The lock in turn is only obtained when actually needed, reducing some
load on Redis.

Fixes gitlab-org/gitlab-ce#22213

Unverified

028c086f

07 Jul, 2016 1 commit
- Update test with new factory name · 70a64f6a
  Douwe Maan authored 8 years ago
  
  70a64f6a
06 Jul, 2016 1 commit
- Use a more future-proof check for Note/LegacyDiffNote · 19e15ae2
  Rémy Coutable authored 8 years ago
```
Signed-off-by: Rémy Coutable <remy@rymai.me>
```
  Unverified
  
  19e15ae2
04 Jul, 2016 1 commit
- Fix diff comments not showing up in activity feed · 4834e2e6
  Rémy Coutable authored 8 years ago
```
Signed-off-by: Rémy Coutable <remy@rymai.me>
```
  Unverified
  
  4834e2e6
16 Jun, 2016 2 commits
- Revert "squashed merge and fixed conflicts" · 452c076a
  James Lopez authored 8 years ago
```
This reverts commit 13e37a3e.
```
  452c076a
- squashed merge and fixed conflicts · 13e37a3e
  James Lopez authored 8 years ago
  
  13e37a3e
13 Jun, 2016 1 commit
- Project members with guest role can't access confidential issues · b56c4567
  Douglas Barbosa Alexandre authored 8 years ago
  
  b56c4567
03 Jun, 2016 2 commits
- Revert "Fix merge conflicts - squashed commit" · 721014c9
  James Lopez authored 8 years ago
```
This reverts commit 3e991230.
```
  721014c9
- Fix merge conflicts - squashed commit · 3e991230
  James Lopez authored 8 years ago
```
# Conflicts:
#	app/models/project.rb
```
  3e991230
09 May, 2016 1 commit

Remove the annotate gem and delete old annotations · f1479b56

authored 8 years ago

In 8278b763 the default behaviour of annotation
has changes, which was causing a lot of noise in diffs. We decided in #17382
that it is better to get rid of the whole annotate gem, and instead let people
look at schema.rb for the columns in a table.

Fixes: #17382

Unverified

f1479b56

25 Apr, 2016 1 commit
- Throttle the update of `project.last_activity_at` to 1 minute · 533cd8c2
  Rémy Coutable authored 8 years ago
```
Signed-off-by: Rémy Coutable <remy@rymai.me>
```
  Unverified
  
  533cd8c2
24 Mar, 2016 2 commits
- Rename `Event#proper?` to `Event#visible_to_user?` · 39a51f9e
  Douglas Barbosa Alexandre authored 8 years ago
  
  39a51f9e
- Comments on confidential issues doesn't show in activity feed to non-members · 23956033
  Douglas Barbosa Alexandre authored 8 years ago
  
  23956033
17 Mar, 2016 1 commit
- Restrict access to confidential issues on activity feed · 9222459e
  Douglas Barbosa Alexandre authored 8 years ago
  
  9222459e
27 Jan, 2016 1 commit

Use Atom update times of the first event · de7c9c7a

authored 8 years ago

By simply loading the first event from the already sorted set we save
ourselves extra (slow) queries just to get the latest update timestamp.
This removes the need for Event.latest_update_time and significantly
reduces the time needed to build an Atom feed.

Fixes gitlab-org/gitlab-ce#12415

de7c9c7a

09 Dec, 2015 1 commit
- Tag model specs · 1e8d703a
  Douwe Maan authored 9 years ago
  
  1e8d703a
18 Nov, 2015 2 commits

Added Event.limit_recent · 01620dd7

authored 9 years ago

This will be used to move some querying logic from the users controller
to the Event model (where it belongs).

01620dd7

Faster way of obtaining latest event update time · 054f2f98

authored 9 years ago

Instead of using MAX(events.updated_at) we can simply sort the events in
descending order by the "id" column and grab the first row. In other
words, instead of this:

    SELECT max(events.updated_at) AS max_id
    FROM events
    LEFT OUTER JOIN projects   ON projects.id   = events.project_id
    LEFT OUTER JOIN namespaces ON namespaces.id = projects.namespace_id
    WHERE events.author_id IS NOT NULL
    AND events.project_id IN (13083);

we can use this:

    SELECT events.updated_at AS max_id
    FROM events
    LEFT OUTER JOIN projects   ON projects.id   = events.project_id
    LEFT OUTER JOIN namespaces ON namespaces.id = projects.namespace_id
    WHERE events.author_id IS NOT NULL
    AND events.project_id IN (13083)
    ORDER BY events.id DESC
    LIMIT 1;

This has the benefit that on PostgreSQL a backwards index scan can be
used, which due to the "LIMIT 1" will at most process only a single row.
This in turn greatly speeds up the process of grabbing the latest update
time. This can be confirmed by looking at the query plans. The first
query produces the following plan:

    Aggregate  (cost=43779.84..43779.85 rows=1 width=12) (actual time=2142.462..2142.462 rows=1 loops=1)
      ->  Index Scan using index_events_on_project_id on events  (cost=0.43..43704.69 rows=30060 width=12) (actual time=0.033..2138.086 rows=32769 loops=1)
            Index Cond: (project_id = 13083)
            Filter: (author_id IS NOT NULL)
    Planning time: 1.248 ms
    Execution time: 2142.548 ms

The second query in turn produces the following plan:

    Limit  (cost=0.43..41.65 rows=1 width=16) (actual time=1.394..1.394 rows=1 loops=1)
      ->  Index Scan Backward using events_pkey on events  (cost=0.43..1238907.96 rows=30060 width=16) (actual time=1.394..1.394 rows=1 loops=1)
            Filter: ((author_id IS NOT NULL) AND (project_id = 13083))
            Rows Removed by Filter: 2104
    Planning time: 0.166 ms
    Execution time: 1.408 ms

According to the above plans the 2nd query is around 1500 times faster.
However, re-running the first query produces timings of around 80 ms,
making the 2nd query "only" around 55 times faster.

054f2f98

12 Feb, 2015 1 commit
- Updated rspec to rspec 3.x syntax · 0c4a70a3
  Jeroen van Baarsen authored 9 years ago
```
Signed-off-by: Jeroen van Baarsen <jeroenvanbaarsen@gmail.com>
```
  0c4a70a3
03 Nov, 2014 2 commits
- Factor '0' * 40 blank ref constants · c3be1517
  Ciro Santilli authored 10 years ago
  
  c3be1517
- Remove dead Event#new_branch? method · c49cb40f
  Ciro Santilli authored 10 years ago
  
  c49cb40f
17 Jun, 2014 1 commit
- Fix project deletion and tests · 73f91da8
  Dmitriy Zaporozhets authored 10 years ago
  
  73f91da8