Concourse Versions Save

Concourse is a container-based continuous thing-doer written in Go.

v6.7.8

2 years ago

✈️ Features

  • Optimize build log collection (#7334 ) @evanchaoli :link:
    • Optimized a SQL statement used to remove build logs. This optimization will specially benefit large deployments that have a lot of pipelines.

🐞 Bug Fixes

🤷 Miscellaneous

  • Backport baggageclaim v1.8.0 to release/6.7.x (#7428) @taylorsilva :link:

  • Use overlay driver for worker integration test [6.7.x] (#7435) @aoldershaw :link:

📦 Bundled resource types

v7.4.0

2 years ago

✈️ Features

  • Fly clear-resource-cache command (#7003) @EstebanFS :link:

    • Added fly command clear-resource-cache, you could use this following the next format fly -t ci clear-resource-cache -r pipeline/resource [--version some:version]
  • Build page shows name of who triggered the build in header line of build page (#7112) @evanchaoli :link:

    • The build page now shows the username of who triggers the build if the build is triggered manually.
  • Add page to view all builds/resource versions downstream/upstream from a root resource version (#7125) @chenbh :link:

    • Disabled by default since computing causality for large datasets can be expensive, use --enable-resource-causality or $CONCOURSE_ENABLE_RESOURCE_CAUSALITY=true to enable the web UI and API endpoint.
      • Most datasets (like the merge commit for this PR) have < 100 builds and/or resource versions and take < 100ms, but it's possible for some "slow paced" resource versions (i.e. very infrequent new versions) to generate extremely large datasets
      • There is an automatic cutoff at 5000 builds or 25000 resource versions. On our deployment, the call for our slowest paced resource took about ~7 seconds to process, most of which is spent in the DB query
    • The causality page can be navigated to from the resource page Screen Shot 2021-06-03 at 11 37 08 AM
    • The causality page displays all the builds and resource versions that was generated from (downstream) or resulted in (upstream) the creation of a particular resource version Screen Shot 2021-06-03 at 11 25 03 AM
    • The downstream graph will put the root resource version on the left whereas the upstream graph will put it on the right
    • It takes into account all the intermediate resource versions when computing the final graph. In the picture above, while the resource page only shows that git version: 123 is a direct input to integrate #4 & #5, there is also an indirect link from git version: 123 -> test #19 -> ... -> intermediate-3 version:123 -> integrate #6 & #6.1
  • Support soft policy enforcement (#7139) @evanchaoli :link:

    • This feature doesn't break the existing OPA policy check. If you have enabled OPA policy check, and you don't need "soft" policy enforcement, then you just don't need to do any configuration change.
    • 3 new ATC cli options are added:
      • CONCOURSE_OPA_RESULT_ALLOWED_KEY: specifies a key of allow flag in OPA returned result
      • CONCOURSE_OPA_RESULT_SHOULD_BLOCK_KEY: specifies a key of should-block flag in OPA returned result
      • CONCOURSE_OPA_RESULT_MESSAGES_KEY: specifies a key of messages in OPA returned result

    For example, if OPA returns the following result:

    {
        "result": {
            "allow": true,
            "block": true,
            "reasons": ["foo", "bar"]
        }
    }
    

    then CONCOURSE_OPA_RESULT_ALLOWED_KEY should be set to result.allow; CONCOURSE_OPA_RESULT_SHOULD_BLOCK_KEY should be result.block, and CONCOURSE_OPA_RESULT_MESSAGES_KEY should be result.reasons.

    NOTE: allow and block in OPA result should be boolean type, because it's easy to convert other types to boolean in an OPA policy.

  • Add ability to comment on a build (#7147) @multimac :link:

    • You can now leave comments on builds. For instance, this can be used to give context to your coworkers about why a particular build failed: Screen Shot 2021-06-30 at 5 40 45 PM

    • If a build has a comment, it is displayed with a small marker to help you quickly find builds of interest. Hovering over the build displays a portion of the comment: Screen Shot 2021-06-30 at 5 41 52 PM

  • Add teamName to concourse_steps_wait_duration metrics (#7154) @Esysc :link:

  • Use browser cache API for dashboard caching (#7247) @aoldershaw :link:

    • The cached API responses on the dashboard no longer need to get truncated, which was previously introduced to work around localStorage limits
  • Allow interpolation in the across step values (#7252) @aoldershaw :link:

    • The across step now supports dynamic interpolation of values. For instance, this can be combined with the set_pipeline step and instanced pipelines to set a dynamic list of pipelines:
      - load_var: branches
        file: branches/branches.json
      - across:
        - var: branch
          values: ((.:branches))
        set_pipeline: my-app
        file: ci/pipelines/my-app.yml
        instance_vars: {branch: ((.:branch))}
      
  • Cache the list of workers in memory (#7268) @aoldershaw :link:

    • Scheduling containers should be more performant by reducing the number of required database calls
  • Optimize build log collector (#7327) @evanchaoli :link:

    • Optimized a SQL statement used to remove build logs. This optimization will specially benefit large deployments that have a lot of pipelines.
  • Enable emitting dogstatsd metrics over uds (#7338) @jmhwang7 :link:

    • The Datadog emitter can now be configured to communicate with the Datadog agent over Unix Domain Sockets

🐞 Bug Fixes

  • containerd: properly populate /etc/hosts and /etc/hostname (#7041) @muntac :link:

    • containerd: /etc/hosts and /etc/hostname are correctly populated
  • Handle 403 for vault preflight check of V2 (#7057) @xtremerui :link:

  • atc: across step logs errors (#7090) @taylorsilva :link:

    • Across step emits an error event when one of the sub-steps errors
  • containerd: Mount /dev/fuse to privileged containers (#7098) @aoldershaw :link:

  • atc(fix): fixed a bug in resource check rate limiter. (#7102) @evanchaoli :link:

    • Fixed a bug in check rate limiter that caused slow checks.
  • fix BaseResourceType for streamed volumes (#7108) @vito :link:

  • Fix worker restart issue with containerd daemon and beacon (#7113) @muntac :link:

    • Fix worker stall issue when restarting with containerd. Exit the worker's beacon process gracefully if any other top level process like the containerd daemon fails. Wait for containerd daemon to come up before starting the containerd Garden server.
  • Fix memory leak in notification bus (#7120) @aoldershaw :link:

  • containerd: default to root if /etc/passwd is missing (#7124) @aoldershaw :link:

    • Fixes a regression introduced in 7.3.0 that prevented containers that don't have an /etc/passwd file from running
  • Fix algorithm considering reruns as new builds (#7144) @taylorsilva :link:

    • Fixes pipelines getting stuck with the same inputs when a job upstream of a job with version: every succeeds and is rerun
  • containerd: keep tasks running after concourse worker restarts gracefully (#7148) @aoldershaw :link:

    • The containerd runtime is now more resilient to the concourse worker process gracefully restarting (e.g. via monit restart)
      • Tasks that were started prior to restart will continue to run when the worker process comes back up
      • This matches the behaviour of the Guardian runtime
  • Fixed build log reaper not respecting when both Days and Builds are set (#7179) @EstebanFS :link:

    • The build log reaper has two options for determining when to reap logs. Before, if both of the options are set, it would reap if either of the two options were true, rather than requiring both of them to be satisfied
  • Apply a minimum rate limit for resource checking (#7218) @aoldershaw :link:

    • If CONCOURSE_MAX_CHECKS_PER_SECOND is unset, Concourse will try to distribute checks evenly over the course of the check interval to reduce the concurrent load on external systems.
    • If there are few resources in a Concourse deployment (~1-20), checks may have to wait a substantial amount of time to run in order to space the checks out evenly. However, there's no real benefit to doing this, since having just a few resources doesn't cause significant load in the first place.
    • Now, Concourse ensures that at least one check is allowed to run per second
  • atc/db: prevent creation of duplicate check builds (#7221) @taylorsilva :link:

    • Prevent duplicate checks from being created for a single resource
  • Fix browser back button after selecting a group (#7249) @aoldershaw :link:

    • Previously, if a pipeline group was selected in the UI, the back button would not work (you'd have to press it twice to go back)
  • set_pipeline unpauses previously archived pipelines (#7255) @aoldershaw :link:

    • When an archived pipeline is un-archived via the set_pipeline step, it will be unpaused
  • GC task caches belonging to archived pipelines (#7272) @aoldershaw :link:

  • containerd: Clean up networking files in /tmp (#7276) @taylorsilva :link:

    • Fixed a bug where the containerd runtime would create networking related files under /tmp and never delete them. They are now made under the --work-dir set for the worker and are cleaned up when the container is deleted. You can delete any lingering network files under your workers /tmp directory after upgrading.
  • Fix prometheus emitter not setting default attributes (#7294) @chenbh :link:
    Additional metrics attributes configured by --metrics-attribute now propagates to the prometheus emitter correctly.

  • run check builds GC in batch (#7323) @xtremerui :link:

🤷 Miscellaneous

  • Cleanup gomod and skip building fly by default in Dockerfile (#7058) @chenbh :link:

  • topgun: update regex for capturing instances (#7064) @taylorsilva :link:

  • containerd: update config to v2 format (#7078) @muntac :link:

  • [topgun/k8s] disable container limits failure test (#7087) @aoldershaw :link:

  • [topgun/k8s] disable container limits fail test (#7088) @aoldershaw :link:

  • use ip command instead of ifconfig in MTU test (#7089) @aoldershaw :link:

  • Merge 7.3.x branch into master (#7097) @aoldershaw :link:

  • fix go.sum (#7126) @aoldershaw :link:

  • [release/7.3.x] fix go.sum (#7127) @chenbh :link:

  • Move garden config tests to integration suite (#7135) @aoldershaw :link:

  • integration: add retries to vault initialization (#7142) @aoldershaw :link:

  • integration: fix package name (#7143) @aoldershaw :link:

  • Update README to point to GitHub discussions (#7159) @mthaddon :link:

  • fly: add --team flag to checklist (#7162) @tech-geek29 :link:

  • Reduce number of JOINs in builds query (#7184) @aoldershaw :link:

  • Treat resource types as checkable in rate limiter (#7211) @taylorsilva :link:

  • Remove prototype_id from builds table (#7220) @aoldershaw :link:

  • k8s/topgun: use regex to match error message (#7226) @taylorsilva :link:

  • Note that all workers need to be on the same network for p2p (#7227) @taylorsilva :link:

  • Revert "worker: make containerd the default runtime" (#7235) @taylorsilva :link:

  • Left-align job name in build header (#7236) @aoldershaw :link:

  • web: behavior: Fix snake_case method "button_tooltip" (#7238) @multimac :link:

  • Don't include check containers in worker cache (#7275) @aoldershaw :link:

  • containerd: split long property values into multiple chunks (#7288) @aoldershaw :link:

  • [containerd integration] Make host IP parsing more robust (#7293) @aoldershaw :link:

  • Run dependabot at midnight UTC (#7295) @chenbh :link:

  • Properly map version_from within across substep (#7310) @aoldershaw :link:

  • Update template (#7341) @taylorsilva :link:

  • Fix dogstatsd UDS file configuration (#7343) @aoldershaw :link:

📦 Bundled resource types

v6.7.7

2 years ago

✈️ Features

  • Optimize build log collector (#7334) @evanchaoli :link:
    • Optimized a SQL statement used to remove build logs. This optimization will specially benefit large deployments that have a lot of pipelines.

🐞 Bug Fixes

  • Add autocomplete=off to all form tags (#6921) @taylorsilva :link:

    • add `autocomplete="off" to the form and username tags on the login page
  • Fix memory leak in notification bus (#7157) @taylorsilva :link:

🤷 Miscellaneous

  • Bump otel to 0.20.0 (#7305) @xtremerui :link:
    • Bump opentelemetry to v0.20.0 to address CVE in one of its dependancies (apache/thrift)

📦 Bundled resource types

v7.3.2

2 years ago

🐞 Bug Fixes

  • [7.3.x] Fix memory leak in notification bus (#7164) @aoldershaw :link:

🤷 Miscellaneous

  • [release/7.3.x] fix go.sum (#7127) @chenbh :link:

📦 Bundled resource types

v7.3.1

2 years ago

🐞 Bug Fixes

  • Bump guardian to 1.19.28 :link:
    • Fixes a bug where guardian would fail to start up when the kernel version contained an unexpected suffix

🤷 Miscellaneous

  • [topgun/k8s] disable container limits fail test (#7088) @aoldershaw :link:

📦 Bundled resource types

v7.3.0

2 years ago

🚨 Breaking

  • Bump opentelemetry to 0.19.0 (#6787) @aoldershaw :link:
    • The service name Honeycomb tracing exporter is now configured via the more general --tracing-service-name (CONCOURSE_TRACING_SERVICE_NAME) rather than --tracing-honeycomb-service-name (CONCOURSE_TRACING_HONEYCOMB_SERVICE_NAME)

✈️ Features

  • Cache streamed volumes and use local cache when looking for volumes (#6660) @evanchaoli :link:
    Optimize resource cache streaming and get step.

    • Mark streamed resource cache volumes as resource cache, to avoid duplicate streaming in next runs.
    • If a resource from a get can be found on some workers, then get step will do nothing. This will reduce times of Concourse connecting to external systems, such as git, docker hub, and so on.
    • This feature is currently opt-in and can be enabled using CONCOURSE_ENABLE_CACHE_STREAMED_VOLUMES flag.
  • Re-ordering instanced pipelines (#6830) @EstebanFS :link:

    • Instanced Pipelines are allowed to be re-ordered with in their group through the UI (using the drag and drop functionality) or using the fly command: fly -t dev oip -g groupName -p key1:var1 -p key2:var2
  • Enhance syslog-drainer to make it more useful (#6834) @SimonXming :link:

    • Add event_id into syslog-drainer entries, to get the correct order of "drained" build logs.
    • Add more supported event_type for syslog-drainer to include more info for "drained" build logs.
  • Enhance webhook triggered checks (#6854) @evanchaoli :link:

    • When multiple pipelines hold a common resource and webhook calls against the common resource, checks are sent to all pipelines at same time. Without this enhancement, each webhook call will cause a check to run. With this enhancement, only a single check will run, which is the expected behavior as a global resource.
  • Allow override of container limits in task config (#6867) @BooleanCat :link:

    • Pipeline authors can now set container_limits for reusable tasks in pipelines. Any limits set in the pipeline will override the limits set within the reusable task file.
  • Use cursor-based pagination for build events (#6873) @aoldershaw :link:

    • Optimizes fetching build logs from the DB for builds with massive logs
  • Use display_user_id field to render username in web interface (#6970) @logyball :link:

  • Set Content-Security-Policy and Cache-Control Headers (#6949) @taylorsilva :link:

    • A Content-Security-Policy header is now set with a default value that will block framing of the Concourse web UI. This was already possible with the default value of the X-Frames-Option header.
      • The CSP header value is configurable with CONCOURSE_CONTENT_SECURITY_POLICY
    • A Cache-Control header is set on every page with a default value of no-store, private. The value of the header is overwritten for some paths (i.e. web assets)

🐞 Bug Fixes

  • Ensure stdin never errors when using containerd with TTY enabled (#6791) @chenbh :link:

    • Fixed bug with containerd runtime where builds to error out if it runs for a long time without any output
  • Add trigger for deleting pipeline (#6880) @xtremerui :link:

    • Fix a bug that might leave orphan pipeline_build_events_* table in DB when deleting a team. Pipelines belong to the deleted team will be destroyed by DELETE CASCADE but associated events table was not cleaned up properly.
  • Fix volume GC query to not include volumes with children (#6902) @xtremerui :link:

    • Fix query that causes volume cannot be destroyed as children are present in web and update or delete on table "volumes" violates foreign key constraint "volumes_parent_id_fkey" in DB.
  • Set autocomplete to off for login form (#6920) @taylorsilva :link:

    • add autocomplete="off" to the top-level form and username tags.
  • Scan unchecked resource-types (#6923) @EstebanFS :link:

    • Fixed an edge case where a put-only resource's parent-type would not be checked
  • Ignore "not found" error on process deletion for Containerd runtime (#6959) @aoldershaw :link:

  • worker: Set PATH based on UID instead of container's privileged state (#6982) @taylorsilva :link:

    • Containerd: fixed a bug where PATH did not contain directories to system tools (i.e. /sbin) when a user/process was root. Only effects unprivileged containers.
  • Fix Postgres deadlock when frequently setting pipelines (#7011) @aoldershaw :link:

  • containerd: allow use of non-existent uids (#7029) @muntac :link:

    • containerd supports running images with non-existent UIDs such as distroless images.

🤷 Miscellaneous

  • Add exception handling in bigint migration (#6848) @xtremerui :link:

  • add help text for password connector flag (#6876) @aoldershaw :link:

  • skip DNS proxy test with extra DNS server (#6878) @aoldershaw :link:

  • Optimise fake generation (#6885) @BooleanCat :link:

  • Refactor: Pick -> Approve (#6886) @taylorsilva :link:

  • Optimise fake generation (#6901) @BooleanCat :link:

  • re-enable k8s dns proxy test (#6906) @chenbh :link:

  • atc: fix asset leak in api tests (#6918) @vito :link:

  • web/wats: replace deprecated waitFor usage (#6922) @vito :link:

  • Correct a typo in project documentation (#6925) @plan-do-break-fix :link:

  • topgun/k8s: move dns proxy tests to integration (#6947) @chenbh :link:

  • Don't check resource-types from paused pipelines (#6961) @taylorsilva :link:

  • Add drills environment to major release template (#6967) @clarafu :link:

  • Using osFlag to fix windows worker error (#6971) @EstebanFS :link:

  • contributing: cover governance model, be brief (#6972) @vito :link:

  • [skip-migrations-check] bump secondary_order migration (#6979) @aoldershaw :link:

  • .github: set up security scanning action (#6992) @vito :link:

  • Fix bugs related to caching streamed volumes (#7001) @aoldershaw :link:

  • Fix caching when streamed volume was also streamed (#7024) @aoldershaw :link:

  • Upgraded xenial to bionic stemcell (#7037) @EstebanFS :link:

  • 7.3.x: make resource cache streaming opt-in (#7051) @taylorsilva :link:

  • 7.3.x: Fix capture of failing topgun tests logs (#7055) @taylorsilva :link:

  • 7.3.x: update regex for capturing instances (#7063) @taylorsilva :link:

📦 Bundled resource types

v7.2.0

3 years ago

🚨 Breaking

  • Wait for worker matching strategy when scheduling build steps (#6635) @multimac :link:
    • Previously, if no workers satisfied the container placement strategy for a step (with the exception of task steps when using the limit-active-tasks placement strategy), the step would simply error the build
    • Now, all steps will wait for a worker to become available
    • The metric concourse_tasks_waiting was removed and replaced with concourse_steps_waiting{type="task"}

✈️ Features

  • Add ability to navigate to resources page from build page (#6662) @chenbh :link:
    UI: clicking on the version text for a get/put step in the Build page will now navigate directly to the Resource page with the corresponding version expanded

  • Allow using LDAP as a password connector (#6671) @aoldershaw :link:

    • By setting --password-connector ($CONCOURSE_PASSWORD_CONNECTOR) to ldap, you can authenticate to Concourse with fly login -u ... -p ... using your LDAP credentials
      • Enabling this feature prohibits the use of local users
    • If you use an attribute other than username for authenticating with LDAP (e.g. email address), you can now configure --username-prompt ($CONCOURSE_USERNAME_PROMPT) to change the help text when logging in via the UI
  • Optimize check creation in DB (#6845) @aoldershaw :link:

  • Add DB index to optimize paginating job builds (#6871) @aoldershaw :link:

  • enhance put.inputs detect to ignore prefixed . and .. (#6705) @evanchaoli :link:

    • input: detect now can handle paths prefixed by . and ...

🐞 Bug Fixes

  • Fix empty worker tags (#6057) @aholyoake-bc :link:

  • runtime: check if swap limits is enabled (#6652) @taylorsilva :link:

    • The containerd runtime will conditionally set memory swap limits if it detects that memory swap limits are enabled
  • runtime: timeout set to 0 means there is no timeout (#6655) @EstebanFS :link:

    • When CONCOURSE_CONTAINERD_REQUEST_TIMEOUT is set to 0 that means there is no timeout
  • feat(atc): add check build metrics. (#6656) @evanchaoli :link:

    • Fixed metrics BuildsStarted, BuildsRunning, BuildStarted, BuildFinsished to exclude check builds.
    • Added check build metrics: CheckBuildsStarted, CheckBuildsRunning, CheckBuildStarted, CheckBuildFinsished
  • better handling for containerd error message (#6668) @muntac :link:

    • Fixed a bug with the containerd runtime where gracefully stopping a container might have failed with an unhandled error. Now it gracefully shuts down.
  • Prevent UI from stalling when you keep the resource page open for a while (#6703) @aoldershaw :link:

  • move migration table updating SQL into a migration transaction (#6727) @xtremerui :link:
    Fix a bug where a completed migration was not recorded in migrations_history table

  • Build image resource caches foreign key constraint to job ids should be on delete cascade (#6757) @clarafu :link:

    • This change fixes a bug that was introduced in v7.1.0 where deleting a pipeline could possibly result in a 500 error. This was caused by a foreign key constraint within the build_image_resource_caches table referencing a job in the jobs table.
  • Fix race condition in containerd runtime resulting in lost output for quickly printing-then-exiting processes (#6776) @vito :link:

  • update check metrics comments. (#6858) @evanchaoli :link:

    • Just update code comments, no release impact.

🤷 Miscellaneous

  • Bump dex to 0.4.0 and remove pkger (#6669) @aoldershaw :link:

  • github config updates (#6672) @vito :link:

  • Add "misc" label to dependabot (#6700) @chenbh :link:

  • fix go mod (#6716) @aoldershaw :link:

  • more lenient context deadline checking for containerd graceful process termination (#6717) @aoldershaw :link:
    Very rarely graceful shutdowns will return a context timeout error instead of gracefully shutting down.

  • fix font antialiasing issue due to less dependency bump (#6739) @aoldershaw :link:

  • Don't enforce step timeout during worker selection (#6760) @aoldershaw :link:

  • Avoid task specific error in load_var file artifact look-up (#6829) @kjgorman :link:

  • remove accidentally committed sql in migration (#6841) @xtremerui :link:

  • add release/undocumented labels to dependabot (#6842) @chenbh :link:

  • .github: add template for rfc tracking issue (#6844) @vito :link:

  • Remove unused ci folders (#6870) @taylorsilva :link:

  • [release/7.2.x] fast-forward (#6875) @chenbh :link:

  • [release/7.2.x] add help text for password connector flag (#6877) @aoldershaw :link:

📦 Bundled resource types

v6.7.6

3 years ago

🐞 Bug Fixes

  • backport #6197: Prevent retrying on worker error when build is aborted (#6598) @evanchaoli :link:

  • Bump lib/pq to 1.10.0 which fixes a regression in lib/pq where under certain circumstances the driver would not drop dead connections and never recover. (#6746) @taylorsilva :link:

  • Fix a panic in the New Relic metrics emitter (#6747) @taylorsilva :link:

🤷 Miscellaneous

  • Fix unit test after go 1.16 (#6553) @chenbh :link:

  • 6.7 remove packr backport (#6585) @xtremerui :link:

  • switch base_image build arg to lowercase (#6754) @aoldershaw :link:

📦 Bundled resource types

v7.1.0

3 years ago

✈️ Features

  • Allow favoriting instance groups (#6465) @aoldershaw :link:

  • Start non-privileged containers in their own cgroup namespace (#6492) @muntac :link:

  • Change SideBar "menu" icon (#6516) @chenbh :link:

    • Updated the visuals for the button to open and close the sidebar
  • Adjust spacing and padding for elements in pipeline card view in Dashboard (#6539) @xtremerui :link:

  • Show warning for pipelines configured with 'set_pipeline' step (#6621) @Infra-Red :link:

    • fly set-pipeline now prints warning message when the pipeline has already been configured through a set_pipeline step.
  • Bump baggageclaim to v1.11.0 (#6643) @taylorsilva :link:

    • Privileged container initialization will be much faster for workers using OverlayFS as the baggageclaim driver and if their kernel supports OverlayFS's metacopy feature

🐞 Bug Fixes

  • Skip build log reaping process for paused jobs (#6573) @aoldershaw :link:

  • Fix reaped link in UI (#6579) @aoldershaw :link:

  • containerd: fix mount issues with certain images (#6592) @aoldershaw :link:

    • Fix an issue on the containerd runtime where processes fail to run with certain container images
  • Check parent resource types of resources that have set check_every: never (#6603) @taylorsilva :link:

    • Resources that had check_every: never who's type was defined in resource_types in their pipeline, would fail to check because the parent resource type would never be checked
  • Bump elm-ansi to support 8-bit and 24-bit ANSI colors (#6605) @aoldershaw :link:

    • Fixes a bug where ANSI escape codes for 8-bit/24-bit colors were misinterpreted, resulting in build logs blinking and other peculiarities
  • Only interpolate static vars when it does not contain a source (#6619) @chenbh :link:

    • Fixed bug where static vars from fly set-pipeline -v ... -y ... were interpolated into local vars ((.:var))
  • containerd: infer MTU from host's network interface (#6624) @aoldershaw :link:

    • In prior versions of Concourse, the Containerd runtime always set the MTU of the container bridge network to the system default
    • Now, the Containerd matches Guardian's behavior by:
      • Detecting the external IP of the host (can be set explicitly using CONCOURSE_CONTAINERD_EXTERNAL_IP)
      • Extracting the MTU from the network interface corresponding with that IP (can be set explicitly using CONCOURSE_CONTAINERD_MTU)

🤷 Miscellaneous

  • Cache streamed volumes and try to get by looking for local cache (#6495) @evanchaoli :link:

  • Fix unit test after go 1.16 (#6545) @chenbh :link:

  • Switch migrations + web assets to Go 1.16 embedding (#6550) @vito :link:

  • Cleanup from staticcheck (#6561) @taylorsilva :link:

  • Update fake files generated by latest version of counterfeiter. (#6564) @evanchaoli :link:

  • Default to containerd in docker-compose.yml (#6580) @aoldershaw :link:

  • Bump elliptic from 6.5.3 to 6.5.4 (#6653) @dependabot :link:

  • Revert "feat(atc): cache streamed volumes and try to get by looking for local cache" (#6659) @clarafu :link:

📦 Bundled resource types

v7.0.0

3 years ago

🚨 Breaking

  • Run checks as builds (#6022) @vito :link:

    • Breaking change: unique_version_history can no longer be configured on resource types. No one seemed to be using it, and it made internal architecture unnecessarily complicated. The need for it should go away entirely as we make progress on the v10 roadmap.
    • Resource check operations, which collect and save versions for pipeline resources, are now run as builds.
      • This is largely an internal architecture refactor, but it also improves UX - check output can now be viewed on the resource page!
    • fly check-resource and fly check-resource-type now stream the checking output to the user, just like fly watch and fly trigger-job.
    • This change includes a migration to convert id column of the builds table and all tables referencing build_id to a bigint. This is unfortunately a slow migration, so please anticipate downtime proportional to the amount of builds in your database.
      • If the migration fails with deadlock detected, shut down the other web nodes first.
      • Our large-ish scale test environment took about an hour.
  • Remove aggregate step (#6349) @taylorsilva :link:

    • Removing the aggregate step as planned. It is succeeded by the in_parallel step.

✈️ Features

  • The formerly-experimental containerd runtime is now GA and is considered ready for production use :link:

    • We will be changing the default container runtime from Guardian to containerd in coming releases, but we encourage using the containerd runtime ASAP
    • To enable the containerd runtime, set --runtime ($CONCOURSE_RUNTIME) to containerd on the concourse worker command
    • You will also need to convert any --garden-* ($CONCOURSE_GARDEN_*) flags to their containerd counterparts:
      • --garden-request-timeout ($CONCOURSE_GARDEN_REQUEST_TIMEOUT) -> --containerd-request-timeout ($CONCOURSE_CONTAINERD_REQUEST_TIMEOUT)
      • --garden-dns-proxy-enable ($CONCOURSE_GARDEN_DNS_PROXY_ENABLE) -> --containerd-dns-proxy-enable ($CONCOURSE_CONTAINERD_DNS_PROXY_ENABLE)
      • --garden-network-pool ($CONCOURSE_GARDEN_NETWORK_POOL) -> --containerd-network-pool ($CONCOURSE_CONTAINERD_NETWORK_POOL)
      • --garden-max-containers ($CONCOURSE_GARDEN_MAX_CONTAINERS) -> --containerd-max-containers ($CONCOURSE_CONTAINERD_MAX_CONTAINERS)
      • $CONCOURSE_GARDEN_DENY_NETWORKS -> --containerd-restricted-network ($CONCOURSE_CONTAINERD_RESTRICTED_NETWORK)
      • $CONCOURSE_GARDEN_DNS_SERVER -> --containerd-dns-server ($CONCOURSE_CONTAINERD_DNS_SERVER)
    • If you rely on any Garden config that is not yet supported on our containerd runtime, please open an issue
  • Perform image fetching using check/get sub-steps (#6153) @vito :link:

    • Image fetching for resources and resource types is now handled explicitly in the build plan using check and get steps, and can be inspected in the UI

    image

  • Show resource check build output in web UI (#6137) @vito :link:

  • Enforce SetPipeline policy check in set_pipeline step (#5932) @cludden :link:

    • When OPA integration is enabled, the set_pipeline step now respects the same policy check as fly set-pipeline
  • Speed up database queries by adding a job_id column to build image resource caches table and adding an index for ordering builds of a job (#5993) @clarafu :link:

  • Allow globs in groups (#6060) @andy-paine :link:

    • groups in a pipeline can now match jobs based on globs e.g.:
      groups:
      - name: deploy
        jobs:
        - deploy-*
      
  • Add flag to concourse worker to overwrite init binary path for the containerd runtime (#6086) @chenbh :link:

    • The init binary can be configured using the --containerd-init-bin flag ($CONCOURSE_CONTAINERD_INIT_BIN)
  • Make CNI plugins directory configurable for the containerd runtime (#6120) @muntac :link:

    • CNI plugins directory can be configured using the --containerd-cni-plugins-dir flag ($CONCOURSE_CONTAINERD_CNI_PLUGINS_DIR)
  • Fallback fly intercept to sh when bash is missing (#6098) @shua :link:

    • If no command is specified,fly intercept will first try to use bash for an interactive shell, but if the container returns an error indicating bash is not available, fly will fallback to the more common (but more limited) sh
    • If this fallback logic is not desired, the user can explicitly specify bash as the path argument to the fly intercept command
  • Add Honeycomb.io as optional tracing backend (#6103) @spire-allyjweir :link:

  • [experimental] Group instanced pipelines on UI (#6105) @aoldershaw :link:

    • Instanced pipelines (RFC) provide a mechanism for constructing multiple instances of a pipeline template that differ by some parameters
      • e.g. to support multiple release lines, you may have a collection of instanced pipelines called release that differ by the version line (1.0.x, 1.1.x, 2.0.x, etc.)
    • All instanced pipelines with the same name (but different parameters) will be collected in the UI into a grouping of related pipelines, removing clutter from the dashboard when there are many instances of a pipeline template
    • Instanced pipelines are currently experimental until we work out the UX, but if you'd like to play around with them, you can set the flag --enable-pipeline-instances ($CONCOURSE_ENABLE_PIPELINE_INSTANCES)
  • Update go module dependencies [go.opentelemetry.io/otel] (#6106) @christophermancini :link:

  • Implement support for Vault KV v2 backends (#6115) @daviddob :link:

  • add support for exporting traces via OTLP (#6122) @christophermancini :link:

    • Added support for OTLP as a target for traces to be exported to
  • Add index to speed up build deletion, fix up a few issues with checks as builds (#6125) @vito :link:

  • Add --team flag to fly order-pipelines command (#6132) @witjem :link:

  • Add --team option to fly get-pipeline command (#6144) @techgaun :link:

  • Add --team option to fly expose-pipeline command (#6169) @techgaun :link:

  • Ensure pipelines contain at least one job (#6159) @taylorsilva :link:

    • Pipelines are now validated to ensure that they contain at least one job - pipeline configs with no jobs will be rejected
  • set_pipeline step prints 'no changes to apply' (#6164) @mdb :link:

    • set_pipeline now prints "no changes to apply" and thereby behaves similarly to fly set-pipeline when a pipeline config contains no changes.
  • Update colours and contrast (#6168) @jomsie :link:

  • Experimental support for P2P Volume Streaming (#6186) @evanchaoli :link:

    • Support P2P volume streaming directly between two workers instead of through the ATC.
      • This is an opt-in feature enabled with --enable-p2p-volume-streaming or env var $CONCOURSE_ENABLE_P2P_VOLUME_STREAMING on the web nodes. When this feature is enabled, --baggageclaim-bind-ip on workers should be set to 0.0.0.0 so that baggage claim can be accessed from another workers.
      • This should only be used for clusters where all workers can reach each other on the same local network.
      • Adds --baggageclaim-p2p-interface-name-pattern and --baggageclaim-p2p-interface-family to the worker command.
  • Add a new metric "volumes streamed" (#6187) @evanchaoli :link:

  • Log the worker name when creating a container fails (#6188) @evanchaoli :link:

  • Support chained container placement strategies. (#6208) @evanchaoli :link:

    • Enhanced container placement strategy to support chained strategies, for example CONCOURSE_CONTAINER_PLACEMENT_STRATEGY=volume-locality,fewest-build-containers
  • Add new container placement strategies: limit-max-containers and limit-max-volumes (#6339) @evanchaoli :link:

    • These strategies prevent scheduling on workers that already have too many containers or volumes on them (respectively), according to a limit set by --max-active-containers-per-worker, --max-active-volumes-per-worker
    • A possibly placement strategy chain to better balance workloads across workers could be [limit-max-containers, limit-max-volumes, volume-locality, fewest-build-containers]
      • This strategy chain first filters out workers that already have too many containers/volumes, then chooses all the workers with the most inputs already present locally, breaking ties by preferring the worker with fewer containers
    • Note: workers are not guaranteed to never exceed the maximum specified limits
  • go-concourse surfaces error messages on saving pipelines (#6222) @andy-paine :link:

    • where fly set-pipeline would simply print forbidden when the underlying API call returned a 403 status, now the body of the response will be printed. In particular, errors originating from OPA policy check rejections will be printed.
  • Don't enforce timeouts during image fetching (#6237) @vito :link:

  • Ignore paused jobs when displaying pipeline status in the UI (#6270) @chenbh :link:

    • The UI will no longer consider paused jobs when figuring out the overall status of a pipeline
  • Skip checking put-only resources (#6281) @evanchaoli :link:

    • An optimization which should lower the resource checking load on some instances: instead of checking all resources, only resources which are actually used as inputs will be checked. This feature was released in 6.0.0 and reverted in 6.6.0 because of its side effects. Now after resolving those side effects, it's back.
    • The --enable-skip-checking-not-in-use-resources flag has been removed as it is no longer needed.
  • fly set-pipeline prints pipeline name and instance vars (#6300) @aoldershaw :link:

  • Remove legacy logic for dealing with resource versions that have a check order of zero (#6323) @clarafu :link:

    • Includes a migration that will delete any versions with a check order of 0. This should not affect anything because versions with a check order of 0 are invalid versions.
    • Should speed up some queries that had legacy logic with filtering on versions with a check order of 0.
  • fly: Add fish auto complete (#6329) @Sasasu :link:

    • fly: add autocomplete for fish.
  • start containerd with low oom_score (#6330) @muntac :link:

    • It is recommended that containerd be started with an oom_score of -999. We want it to be at the level of other system daemons. This is so that containerd never runs into an out of memory state before the containers it's managing are cleaned up. At the same time it should not be unkillable.
  • Give worker registration its own database connection pool (#6332) @taylorsilva :link:

    • Give the worker registration endpoint its own database connection pool to avoid the situation where the API connection pool is maxed out and workers fail to register and stall
  • Allow underscore in identifiers (#6338) @aoldershaw :link:

  • Support for mTLS (#6355) @nickhyoti :link:

    • Added support for mTLS between Concourse and a reverse proxy that may be in front of Concourse
  • Allow configuring login and query timeouts for Vault (#6362) @evanchaoli :link:

    • These timeouts can be configured using CONCOURSE_VAULT_LOGIN_TIMEOUT and CONCOURSE_VAULT_QUERY_TIMEOUT respectively
    • The new default login timeout is 60s
  • Expose username of who manually triggered build to build metadata. (#6369) @evanchaoli :link:

    • fly builds has a new column created by that shows a user ID if a build is triggered manually.

    • A new build metadata BUILD_CREATED_BY may be exposed to resource. It is not exposed by default; you need to turn it by add expose_build_created_by when defining a resource:

      resources:
      - name: some-resource
        type: some-type
        expose_build_created_by: true
        source:
          ...
      
    • As different authentication connectors populate different claims, a new concourse web CLI option --concourse-display-user-id-per-connector is added that allow cluster administrator to configure which claims field should be consider as unique user id

      • Values of this option should be in format <connector>:<fieldname>
        • connector is one of: ldap, github, cf, bitbucket-cloud, gitlab, microsoft, oauth, oidc or saml
        • fieldname is one of:
          • user_id mapping to claims' user id field
          • name mapping to claims' username field
          • username mapping to claims' preferred username field
          • email mapping to claims' email field
  • Allow disabling resource checking for individual resources (#6386) @taylorsilva :link:

    • Automatic resource checking for individual resources can be disabled by setting check_every: never in a resource's definition
  • db: lidar checks put-only resources with failed checks (#6412) @taylorsilva :link:

    • Lidar now checks any put-only resources that ran a check which failed.
  • Add a flag to migrate to the latest db version (#6426) @taylorsilva :link:

    • add a --migrate-to-latest-version flag to the migrate command. This flag has concourse perform database migrations to the latest database version.
    • the concourse web command will still automatically migrate the database
  • Enhance search bar filtering and allow filtering by instance group (#6433) @aoldershaw :link:

    • Allow filtering by exact match on the dashboard by quoting search terms
    • Allow applying multiple search filters simultaneously (e.g. team:"main" status:paused)
    • Make search suggestions more intelligent
  • Add more tooltips for action buttons (#6453) @aoldershaw :link:

    • Many buttons in the UI now have a tooltip on hover to indicate what they do
  • Bump BaggageClaim to v1.10.0 (#6500) @vito :link:

    • Windows workers will now shell out to the much faster robocopy executable for copying local files. This should dramatically improve performance for Windows tasks which utilize caches: for caching a bunch of tiny files.
  • metrics: make tasks_wait_duration histogram record up to 1h (#6506) @marco-m-pix4d :link:

  • Removes unnecessary indexes from build events tables (#6522) @clarafu :link:

  • Allow @ in vars path (#6129) @xtremerui :link:

🐞 Bug Fixes

  • fly pin-resource requires a version if the resource is unpinned (#6095) @jamieklassen :link:

    • Previously, you could run the command on an unpinned resource without passing a version -- it would run and succeed, but do nothing. Now the command will fail and print an error message.
  • Fix pipeline cards being rendered off-screen when sidebar was open (#6102) @aoldershaw :link:

    • Fixes occasional bug where pipelines would be rendered off-screen after a refresh on the dashboard
  • Fix quoting for var subkeys (#6108) @vixus0 :link:

    • Fix interpolation of quoted variable fields containing special characters.
  • Prevent set_pipeline runtime error (#6116) @mdb :link:

    • set_pipeline of a YML pipeline configuration file with no jobs: or resources: no longer causes a runtime error: invalid memory address or nil pointer dereference.
  • Use default uid:gid if passwd file does not exist and username is "root" (#6142) @chenbh :link:

    • The containerd runtime will now default to uid:gid 0:0 if username is "root" but /etc/passwd file does not exist
    • This matches the behaviour of the default guardian backend
  • Preserve whitespace within build output (#6157) @clarafu :link:

    • In v6.6.0, whitespace was collapsed in order to fix a bug with horizontal scrolling in the build output. This change will preserve all whitespace while also keeping the horizontal scrolling fix.
  • Prevent retrying on worker error when build is aborted (#6197) @evanchaoli :link:

    • Fixed a endless build retry bug
  • Limit configured by limit-active-tasks is not respected and more tasks can land on a worker (#6216) @aliculPix4D :link:

  • Remove any existing guardian assets (#6257) @taylorsilva :link:

    • The worker will now clear out any existing Guardian assets on start-up (/var/gdn/assets)
    • This fixes in-place upgrade scenarios where guardian was using old versions of runc
  • atc: abort a rerun build if input version gone (#6265) @xtremerui :link:

    • A rerun build will be aborted automatically if required version of any input is not available.
  • set-pipeline prompted unpause-pipeline command should have --team option. (#6336) @evanchaoli :link:

    • Fixed a bug of fly set-pipeline where --team option was missing in the prompted unpause-pipeline command.
  • Fix mount issues on containerd (#6348) @muntac :link:

    • Set the appropriate permissions for mounts in privileged containers.
    • Use the Linux default size for /dev/shm (shared memory) mount.
  • Bump baggageclaim to 1.9.1 to fix deeply-nested volumes with overlay driver (#6393) @vito :link:

    • This was partially fixed by #5961, but that original fix did not solve the problem in all cases
  • Ensure task, set_pipeline, load_var steps have names (#6410) @taylorsilva :link:

    • Return an error when no identifier is provided for task, set_pipeline, and load_var steps
  • add lock for concourse migrate to latest version cmd (#6510) @xtremerui :link:

🤷 Miscellaneous

  • Apply bigint migrations to build_events partitions and read from old/new columns instead (#6305) @vito :link:

    • Rather than migrating the build_events table's build_id column to bigint, which we found to be too slow in large deployments, this PR adds a new bigint column to the build_events table that is populated at runtime
    • When querying for build events, we consider both the old and new column, since builds run prior to the upgrade to 7.0.0 will only have the old column set
  • Migrate build_id references to bigint (#6203) @vito :link:

    • This migration can be quite slow if you have a ton of builds, meaning web nodes may take a while to start upon upgrading.
    • This does not migrate the build_events table - that is handled in #6305
  • Update k8s-topgun for Helm v3 (#6034) @xtreme-vikram-yadav :link:

  • Refactor RunState to have ownership over build vars (#6082) @aoldershaw :link:

  • Switch back to upstream go-flags (#6096) @jamieklassen :link:

  • Fix fly integration tests on windows (#6099) @aoldershaw :link:

  • Add an example for the release note section in the pr template (#6109) @clarafu :link:

  • Add DeepSource config (#6110) @vito :link:

  • Emit start/finish metrics from check step (#6119) @vito :link:

  • Cleanup k8s topgun releases if failure occurs outside of It block (#6131) @chenbh :link:

  • Convert a few failing TopGun tests to Testflight (#6136) @vito :link:

  • Use custom types for CPU/Memory limits (#6143) @aoldershaw :link:

  • Minor refactoring around Variables (#6152) @aoldershaw :link:

  • Introduce dbtest package for higher-level test setup (#6156) @vito :link:

  • k8s topgun: fix panic (#6167) @chenbh :link:

  • Rename engine/builder types, merge into engine/ package (#6174) @vito :link:

  • Use dot notation in instanced pipelines API (#6177) @zoetian :link:

  • Respect tags when fetching images, send public plans in fetching events (#6184) @vito :link:

  • Fix topgun/k8s tests (#6185) @xtreme-sameer-vohra :link:

  • Fix check container placement, and only use check sessions for pipeline resource checks (#6189) @vito :link:

  • Update last check end time on check failure/error (#6192) @vito :link:

  • Bump dex module to v0.2.0 (#6195) @xtremerui :link:

    • Concourse uses a fork of Dex as its auth module. This Dex fork had not being synced up with upstream since v2.16.0. Now it tracks the latest v2.25.0. There are lots of improvements and bug fixes. Also some new connectors are available (Google, OpenShift, Atlassiancrowd etc,), we will evaluate and support them in future releases. An issue is created for tracking https://github.com/concourse/concourse/issues/6194.
  • Determine across step's max_in_flight at runtime (#6200) @aoldershaw :link:

  • Fix pipeline UI non-trigger input edges showing up as trigger (#6202) @vito :link:

  • topgun: use busybox for http proxy... for now (#6204) @vito :link:

  • Fix race condition with check build creation + starting (#6205) @vito :link:

  • Clear build events from previous resource/resource type check (#6211) @aoldershaw :link:

  • fast forward release branch (#6212) @chenbh :link:

  • Use proper types for jobs and resources on pipeline page (#6214) @aoldershaw :link:

  • Acquire lock on builds table in build_id migrations (#6219) @aoldershaw :link:

  • Revert "migration: acquire lock on builds table in build_id migrations" (#6220) @aoldershaw :link:

  • Use proper types for jobs and resources on pipeline page (#6221) @aoldershaw :link:

  • Use base resource type defaults in get/put steps (#6224) @aoldershaw :link:

    • This functionality already exists in v6.7.0+
  • [topgun/k8s] Stop leaking namespaces in test env (#6227) @taylorsilva :link:

  • [release/6.7.x] use k8s/client-go v11.0 for topgun (#6232) @chenbh :link:

  • Check step only acquires lock for periodic pipeline resource checks (#6235) @vito :link:

  • Fix up test data race for build tracker (#6254) @vito :link:

  • Add docker-compose overrides for SAML, LDAP, OAuth, and OIDC (#6262) @chenbh :link:

  • fly: give more leeway for slow test (#6264) @aoldershaw :link:

  • [topgun] skip failing topgun/k8s test (#6287) @taylorsilva :link:

  • Optimize fly tests (#6299) @muntac :link:

  • fly intercept falls back to sh when bash is missing (containerd runtime) (#6304) @aoldershaw :link:

  • add yarn install step to web ui instructions (#6313) @muntac :link:

  • Avoid recompiling fly in tests (#6317) @vito :link:

  • Fix 'fly sync' Windows tests (#6320) @vito :link:

  • fly rename-pipeline can rename an entire instance group (#6321) @aoldershaw :link:

  • Merge v6.7.2 into master [skip-migrations-check] (#6328) @vito :link:

  • limit-active-tasks strategy waits for a worker to be available (#6359) @taylorsilva :link:

    • This is the current behavior of limit-active-tasks, but it was broken during development. It does not impact a released version of concourse
  • Bump dex to patch an XML vulnerability (#6370) @taylorsilva :link:

    • Bump Dex to 2.27.0 which fixes a vulnerability in the go XML library
    • Concourse v6.7.3+ also has this patch applied
  • Refactor worker selection to facilitate adding multiple worker runtimes (#6387) @aoldershaw :link:

  • Deterministic ordering of ?vars params for instanced pipelines API (#6392) @aoldershaw :link:

  • build(deps): bump ini from 1.3.5 to 1.3.8 in /web/wats (#6394) @dependabot :link:

  • build(deps): bump ini from 1.3.5 to 1.3.8 (#6395) @dependabot :link:

  • Fix build finished metric for prometheus (#6401) @taylorsilva :link:

    • This doesn't affect any released version of Concourse, but the bug was introduced in a PR during development
  • Show instance vars in set_pipeline step header (#6409) @aoldershaw :link:

  • Fix archived pipeline check on the resource/job/build page for an instanced pipeline (#6414) @aoldershaw :link:

  • docker: add build arg for base image (#6415) @aoldershaw :link:

  • Fix groups with OIDC connector (#6436) @konstl000 :link:

    • Adds an --oidc-disable-groups flag that disables fetching groups claims from an upstream OIDC provider. By default, the groups claim is fetched (as with previous version of Concourse)
    • This was fixed in 6.7.4 as part of #6448.
  • make it clear on oidc scopes configuration (#6443) @xtremerui :link:

  • Remove --force in k8s-topgun test (#6449) @xtremerui :link:

  • Optimize fly windows tests (#6450) @aoldershaw :link:

  • introduce new integration test suite (#6479) @vito :link:

  • Skip Vault tests in Topgun (#6481) @aoldershaw :link:

  • move pkged.go to cmd/concourse/ (#6482) @vito :link:

  • runtime: add test coverage for container.Stop() (#6483) @muntac :link:

  • hide "group:" dropdown suggestion in search (#6487) @aoldershaw :link:

  • Optimize postgres runner for db tests (#6489) @aoldershaw :link:

  • Ignore archived pipelines for instance group count in top bar (#6491) @aoldershaw :link:

  • rename OIDC skip email verified flag (#6497) @aoldershaw :link:

  • Back-port integration test suite (#6499) @aoldershaw :link:

  • worker/runtime: remove no-op device rule (#6507) @muntac :link:

  • Optimize check deletion (#6511) @aoldershaw :link:

  • Add index and mini refactor to the check lifecycle query (#6517) @clarafu :link:

📦 Bundled resource types