agola

Commit Graph

Author	SHA1	Message	Date
Simone Gotti	d2b09d854f	: use new errors handling library Implement a new error handling library based on pkg/errors. It provides stack saving on wrapping and exports some function to add stack saving also to external errors. It also implements custom zerolog error formatting without adding too much verbosity by just printing the chain error file:line without a full stack trace of every error. Add a --detailed-errors options to print error with they full chain * Wrap all error returns. Use errors.WithStack to wrap without adding a new messsage and error.Wrap[f] to add a message. * Add golangci-lint wrapcheck to check that external packages errors are wrapped. This won't check that internal packages error are wrapped. But we want also to ensure this case so we'll have to find something else to check also these.	2022-02-28 12:49:13 +01:00
Simone Gotti	d1b4ab4296	*: use zerolog for logging Replace zap with zerolog. zerolog has a cleaner interface and can be easily configured with custom error chain printing using a new error handling library that will be implemented in another PR.	2022-02-28 10:40:55 +01:00
Simone Gotti	87f182a0c9	*: use errors.Is/errors.As to handle wrapped error checking Enable golangci-lint errorlint linter to check proper use of errors.Is and error.As instead of direct comparison or error type casting.	2022-02-24 17:07:29 +01:00
Simone Gotti	0e2b01a586	runservice: correctly handle skipped tasks in fetcher skip fetching of tasks with status skipped, not only tasks marked as skip. This avoid many wrong an noisy logs of type "executor task with id taskid doesn't exist. This shouldn't happen. Skipping fetching"	2020-03-02 10:40:59 +01:00
Simone Gotti	eb180da914	Merge pull request #225 from sgotti/runservice_fix_handling_of_wrong_executortask_status runservice: fix handling of wrong executortask status	2020-03-02 10:26:32 +01:00
Simone Gotti	19611c18e7	runservice: fix handling of wrong executortask status updateRunTaskStatus should also accept transitions from not started to a finished state like "success", "failed", "stopped" since we could miss some status updates from the executor for many reasons.	2020-02-28 13:02:35 +01:00
Simone Gotti	3ac018e6e5	runservice: use all scheduled tasks in scheduleRun rename activeExecutorTasks to scheduledExecutorTasks and don't filter out finished tasks. In some logic we need all the scheduled tasks and not only the not finished ones.	2020-02-28 09:56:12 +01:00
Simone Gotti	145c87b4c0	runservice: minimize scheduling of tasks that will be queued by the executor Since the executor only periodically updates its state we could end up scheduling much more tasks than the executor ActiveTasksLimit. This will happen in the case of many parallel tasks that can all start at the same time. To avoid this also considere the executor tasks saved in etcd that represent the real view of scheduled tasks.	2020-02-27 11:03:03 +01:00
Simone Gotti	5dd9e587fe	runservice: mark not running tasks as skipped when run marked to stop Currently when a run is marked to stop we are going to stop currently running tasks and then their childs will be marked as skipped. But tasks not depending on a stopped task (root task or childs with a finished parent) that are just waiting for an executor slot, will be scheduled when there will be a free slot also if the run is marked to stop (and then the scheduler will stop them after some seconds). This patch will mark all not started tasks as skipped when the run is marked to stop.	2020-02-26 16:45:09 +01:00
Simone Gotti	2de91549a3	tests: improve services logging During tests provide a zaptest Logger so all services output will be redirected to golang testing logger. When multiple services of the same type are provided add a unique name field to distinguish them.	2020-01-15 12:30:34 +01:00
Simone Gotti	07cde065c8	runservice: use etcd mutex TryLock on fetching When fetching avoid concurrent fetches from multiple runservices using an etcd mutex TryLock.	2019-11-13 11:53:54 +01:00
Simone Gotti	5ab9f7c970	*: use etcd mutex TryLock etcd PR 11104 (https://github.com/etcd-io/etcd/pull/11104) implemented mutex TryLock. Since it's only available in etcd master just copy relevant code and add a TODO to remove it when updating the etcd client to a version implementing TryLock. Use TryLock everywhere where it'll be useful.	2019-11-12 22:27:17 +01:00
Simone Gotti	72f279c4c3	: improve error handling objectstorage: remove `types` package and move `ErrNotExist` in base package * objectstorage: Implement .Is and add helper `IsErrNotExist` for `ErrNotExist` * util: Rename `ErrNotFound` to `ErrNotExist` * util: Add `IsErr` helpers and use them in place of `errors.Is()` datamanager: add `ErrNoDataStatus` to report when there's not data status in ost * runservice/common: remove `ErrNotExist` and use errors in util package	2019-11-11 12:17:35 +01:00
Simone Gotti	5af07d0852	objectstorage: use a single package remove all the subpackages and just use a single package	2019-11-08 16:31:48 +01:00
Simone Gotti	e18794764e	go.mod: update dependencies Update all the updatable dependencies	2019-10-29 09:31:38 +01:00
Simone Gotti	39829f1ec4	runservice: save step exitstatus in run. For every step save also the command exit status.	2019-09-17 14:35:37 +02:00
Simone Gotti	12b02143b2	runservice: don't save executor task data in etcd Reorganize ExecutorTask to better distinguish between the task Spec and the Status. Split the task Spec in a sub part called ExecutorTaskSpecData that contains tasks data that don't have to be saved in etcd because it contains data that can be very big and can be generated starting from the run and the runconfig.	2019-09-17 12:03:43 +02:00
Simone Gotti	7d375e4c4e	runservice: add run workspace cleaner Removes old workspace files (defaults to 7 days)	2019-09-17 09:40:23 +02:00
Simone Gotti	bfc42ef60e	runservice: fix get tasks to run Currently `advanceRunTasks` isn't deterministic and doesn't calculate the final state in one call. So could happen that `getTasksToRun` will select a task to be executed since its parent are finished (marked as skipped in advanceRunTasks) but the task isn't marked to be skipped (because advanceRunTasks has calculated this task before its parents). Currently fix this doing the same task selection logic done in `advanceRunTasks` and add a TODO to make `advanceRunTasks` be deterministic by processing tasks by their level (from level 0).	2019-08-30 15:59:25 +02:00
Simone Gotti	c1ff28ef9f	*: export clients and related types Export clients and related packages. The main rule is to not import internal packages from exported packages. The gateway client and related types are totally decoupled from the gateway service (not shared types between the client and the server). Instead the configstore and the runservice client currently share many types that are now exported (decoupling them will require that a lot of types must be duplicated and the need of functions to convert between them, this will be done in future when the APIs will be declared as stable).	2019-08-02 12:02:01 +02:00
Simone Gotti	d0c5621201	util: remove time.go The same function is already provided by pointer.go	2019-08-01 14:14:56 +02:00
Simone Gotti	b81ad4cd8c	runservice: fix/improve executor delete logic * Don't fail tasks inside the delete executor action, just delete the executor from etcd * The scheduler, when detecting a task without a related executor will mark the task as failed and correctly set end time of the task and its steps.	2019-07-29 12:06:15 +02:00
Simone Gotti	6f3798e8fe	*: use sleep timer in loops So we'll react instantly to a context cancel instead of waiting on time.Sleep returning.	2019-07-25 16:22:54 +02:00
Simone Gotti	940264e413	runservice: add lock around compatchangegroups just to avoid concurrency errors when multiple instances are running	2019-07-10 10:20:35 +02:00
Simone Gotti	11a2ff48d6	runservice: delete executor task early currently we are deleting the executor tasks only when all the run tasks log/archives were fetched. But it'll better to remove a single executor task when the task fetching is finished. This could also fix possible issues on k8s since we are scheduling tasks but the k8s scheduler may not schedule them if there aren't enough resources causing a scheduling deadlock since we won't remove finished pods because their related tasks are not removed and k8s cannot start new pods since it has no resources.	2019-07-08 16:03:14 +02:00
Simone Gotti	87a472aaaf	runservice: add CacheGroup field to runconfig The cache group fields defines under which cache group the run cache data will belong. This is needed/useful for some next changes: * Make cache correctly work for user direct runs. Since the user direct runs all belong to the same run group (the user id) all the use direct runs will share the same caches. To distinguish between the different caches we need to use something in addition to the user id (the local repo uuid generated by the direct run start command) * Share the cache between multiple projects	2019-07-03 15:16:37 +02:00
Simone Gotti	19793db0c2	runservice: fix linter errors Fix errors reported by default golangci-lint linters	2019-07-02 14:53:01 +02:00
Simone Gotti	8d67844cc4	*: use vanity url use agola.io domain	2019-07-01 11:40:20 +02:00
Simone Gotti	5c911523c5	sentinel: skip executor that don't allow privileged containers if they are requested.	2019-06-13 18:32:56 +02:00
Simone Gotti	a53e14b4e8	runservice: check if executor is alive before scheduling tasks Check that the last update time is less than 1 minute (currently hardcoded)	2019-06-12 18:12:37 +02:00
Simone Gotti	9b2ce717c7	*: migrate to "golang.org/x/xerrors" Just a raw replace of "github.com/pkg/errors". Next steps will improve errors (like remote errors, api errors, not exist errors etc...) to leverage its functionalities	2019-05-23 11:23:14 +02:00
Simone Gotti	b3867fb7ca	objectstorage: add posix standard storage rename the previous posix storage to posixflat and make it currently not user selectable (since I'm not sure it's really worth using it). The new posix storage uses the filesystem without any escaping so it's not a real flat namespace. This isn't a real issue since also minio is not a flat namespace and we are so forced to use it like a hierarchycal filesystem.	2019-05-21 15:17:53 +02:00
Simone Gotti	b95fb98f3c	runservice: move RunEvent to types	2019-05-15 09:40:32 +02:00
Simone Gotti	bec9476d6c	runservice: store related runid with logs and archives Logs and archives can be shared by multiple runs. So removing a run doesn't imply that we could also remote the logs and archives since they could be "referenced" by another run. Store also the runids as specific objects along with the logs and archives so, we'll remove them only when no runids objects exist.	2019-05-08 12:11:46 +02:00
Simone Gotti	1e34dca95d	runservice: split and simplify scheduler and executor naming Also if they are logically part of the runservice the names runserviceExecutor and runserviceScheduler are long and quite confusing for an external user Simplify them separating both the code parts and updating the names: runserviceScheduler -> runservice runserviceExecutor -> executor	2019-05-07 23:56:10 +02:00

35 Commits