JSON wrangling with jq

Help 9 / 10

Summarize data

Maps and Selects

The issues list I was looking at has many low-quality issues in it. Let’s say I want to grab all the items that are labeled. This would let me skip all the drive-by fix-my-problem issues.

Unfortunately, it’s impossible to do this with the GitHub API unless you specify all the possible labels in your query. However, I can easily do this query on the command line by filtering our results with jq. However, to do so, I’m going to need a couple more jq functions.

My query so far looks like this:

jq '[ .[] | { title: .title, number: .number, labels: .labels | length } ]' issues.json

The first thing I can do is simplify it using map.

jq 'map({ title: .title, number: .number, labels: .labels | length })' issues.json

map(...) lets you unwrap an array, apply a filter and then rewrap the results back into an array. You can think of it as a shorthand for [ .[] | ... ] and it comes up quite a bit in my experience, so it’s worth it committing to memory.

I can combine that with a select statement that looks like this:

map(select(.labels > 0))

select is a built-in function that takes a boolean expression and only returns elements that match. It’s similar to the WHERE clause in a SQL statement or array filter in JavaScript.

Like map, I find select comes up quite a bit, so while you may have to come back to this article or google it the first few times you need it, with luck, it will start to stick to your memory after that.

Putting this all together looks like this:

jq 'map({ title: .title, number: .number, labels: .labels | length }) | \
map(select(.labels > 0))' issues.json

This uses three object indexes, two maps, two pipes, a length function, and a select predicate. But if you’ve followed along, this should all make sense. It’s all just composing together filters until you get the result you need.

← Previous Next →

Memory usage

Import

Export

Reset your sandbox

Help