Adding queued querier to knapsack, and OsqueryVersion to menu #1180

seejdev · 2023-05-04T21:27:57Z

This PR adds a new piece of common functionality to Knapsack - a querier for issuing queries to osquery and getting results in an asynchronous fashion.

Why?

We already have the Querier interface and concept scattered around the code base. Some places have reimplemented the same things like backoff/retries, and sleeping to wait for osquery to be ready. It seems logical to consolidate and streamline this functionality, and provide a simple API for doing this. Adding this to Knapsack enables this functionality to be easily accessed throughout the code base.

How?

Knapsack now conforms to the Querier interface and this dependency is injected into it, either as a concrete implementation or a mock. The implementation I've provided is a FIFO queuedQuerier which asynchronously processes queries as they are added, and invokes callbacks to provide errors/results to the original clients.

Additionally, I've added OsqueryVersion as a menu template variable, since we already wanted that, and it provides a nice opportunity to use the queued querier in action. The menu is initially built with a default value for OsqueryVersion, and when the query result is returned, the menu is refreshed with the new data.

directionless

pretty neat. moving to callbacks is interesting, and worth thinking about.

Today, we have a synchronous interface. And many callers seem to expect that style. Some have their own backoff/retry mechanisms. I don't know if the callback style is a good fit for them, maybe it'll just take migration.

I think I like the callback though. So maybe we should exposer Query and QueryCallback or something. And think about what needs a context and timeout?

I guess the other thing to consider, is how we want to handle things that query on an interval. Should the interval move in here? Something like QueryRepeatedly(callback, timer) but that feels messy. Maybe kick repeating forward.

directionless · 2023-05-04T22:49:05Z

pkg/agent/types/querier.go

+type Querier interface {
+	// Query will attempt to send a query to the osquery client. The result of the
+	// query will be passed to the callback function provided.
+	Query(query string, callback func(result []map[string]string, err error)) error


This is changing out common Query interface to being a callback. What do people think about that? Should we give it a different name?

Should we include context? Or some other mechanism for timeouts?

pkg/agent/knapsack/queued_querier.go

directionless · 2023-05-04T22:55:40Z

pkg/agent/knapsack/queued_querier.go

+	ctx, qq.cancel = context.WithCancel(ctx)
+	for {
+		if qq.syncQuerier != nil {
+			for {


Should this get some kind of healthcheck to make sure the underlying querier is actually alive?

Hmm maybe? If it does die, queries will fall on the floor and I assume an error will be returned quickly to the client.

Maybe just a health check in the initial startup. But I'm thinking about intent, I assume the purpose of the callback interface, is to allow various "get info" style things to get chucked into this queue on startup, before osquery finishes booting. From that perspective, we should health check here. We know the first couple queries may fail.

directionless · 2023-05-04T22:57:01Z

pkg/agent/knapsack/queued_querier.go

+	qq := &queuedQuerier{
+		logger:  log.With(logger, "component", "querier"),
+		queue:   q,
+		queueCh: make(chan struct{}, 10),


I arbitrarily chose 10, but I don't have a good feel for what's appropriate.

This would be the number of queries in the queue when Query invocations become blocking, and waiting for other queries to process. Maybe it should be larger just to provide plenty of room?

hmm ... I worry there is a chance that some other part of the code gets blocked why waiting on the channel to drain. Why don't we just leave it unbuffered and put the channel send in the Query func in a goroutine?

func (qq *queuedQuerier) Query(query string, callback func(result []map[string]string, err error)) error { if qq == nil { return errors.New("queuedQuerier is nil") } // Push the item on to the queue, and notify the channel so the query is processed qq.push(&queueItem{query: query, callback: callback}) go func() { qq.queueCh <- struct{}{} }() return nil }

Or we could set the buffer 1 and check that len(qq.queueCh) == 0 before pushing

Reasonable! This should only really get used for the internal stuff, so order of 10 seems okay. Comment please?

Hmm I like @James-Pickett suggestion, it would be nice to guarantee that Query does not block.

I think James is probably correct. There's already the list there.

It makes me wonder if this channel is needed.

The channel is mostly to wake up the processing thread in cases where the queue is empty, or the querier is not yet available.

We could just spawn a goroutine for every query instead of queueing and processing serially... That would assume there is proper thread safety within the querier impls. If we spawn 100 goroutines all submitting queries, would that just work? That would also mean clients should not expect their queries to be run in order necessarily.

We could just spawn a goroutine for every query instead of queueing and processing serially... That would assume there is proper thread safety within the querier impls. If we spawn 100 goroutines all submitting queries, would that just work? That would also mean clients should not expect their queries to be run in order necessarily.

I was thinking the same ... effectively have a stack of blocking goroutines as an unordered queue. I don't think we care about order at this level.

If the caller wanted, they could force their own queries to by synchronous at their level. ie: send query A ... wait for results .. send query B

pkg/agent/knapsack/queued_querier.go

seejdev · 2023-05-05T14:13:22Z

ee/desktop/runner/runner.go

@@ -721,3 +732,33 @@ func iconFilename() string {
 func (r *DesktopUsersProcessesRunner) iconFileLocation() string {
 	return filepath.Join(r.usersFilesRoot, iconFilename())
 }
+
+func (r *DesktopUsersProcessesRunner) refreshOsqueryData() {
+	osqueryVersion := "Unknown"


Right now, when desktop first starts, the menu item will display as osquery Version: . Upon the query result being returned it will change to osquery Version: <the version number> or osquery Version: Unknown in case of an error.

Is osquery Version: what we want for that waiting-for-osquery state? It could be something like osquery Version: Waiting for osquery... 🤷

I don'y think it matters much. Unknown or Waiting...

directionless · 2023-12-22T03:30:17Z

Not going to merge this right now.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding queued querier to knapsack, and OsqueryVersion to menu #1180

Adding queued querier to knapsack, and OsqueryVersion to menu #1180

seejdev commented May 4, 2023 •

edited

Loading

directionless left a comment

directionless May 4, 2023

directionless May 4, 2023

seejdev May 5, 2023

directionless May 5, 2023

directionless May 4, 2023

seejdev May 5, 2023

James-Pickett May 5, 2023 •

edited

Loading

directionless May 5, 2023

seejdev May 5, 2023

directionless May 5, 2023

seejdev May 5, 2023

James-Pickett May 5, 2023

seejdev May 5, 2023

directionless May 5, 2023

directionless commented Dec 22, 2023

Adding queued querier to knapsack, and OsqueryVersion to menu #1180

Adding queued querier to knapsack, and OsqueryVersion to menu #1180

Conversation

seejdev commented May 4, 2023 • edited Loading

Why?

How?

directionless left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

James-Pickett May 5, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

directionless commented Dec 22, 2023

seejdev commented May 4, 2023 •

edited

Loading

James-Pickett May 5, 2023 •

edited

Loading