Grep Clojure code using clojure.spec regexes. Inspired by grape.
This tool allows you to find patterns in Clojure code. I use it as a research tool for sci, clj-kondo or Clojure tickets.
io.github.borkdude/grasp {:mvn/version "0.1.4"}
The grasp.api
namespace currently exposes:
(grasp path-or-paths spec)
: returns matched sexprs in path or paths for spec. Accept source file, directory, jar file or classpath as string as well as a collection of strings for passing multiple paths. In case of a directory, it will be scanned recursively for source files ending with.clj
,.cljs
or.cljc
.(grasp-string string spec)
: returns matched sexprs in string for spec.resolve-symbol
: returns the resolved symbol for a symbol, taking into account aliases and refers. You can also usersym
to create a spec that matches a fully-qualified, resolved symbol.unwrap
: see Finding keywords.cat
,or
,seq
,vec
: see Convenience macros.*
,?
,+
: aliases for(s/* any?)
, etc.
Very alpha. API will almost certainly change.
Assuming you have the following requires:
(require '[clojure.java.io :as io]
'[clojure.pprint :as pprint]
'[clojure.string :as str]
'[clojure.spec.alpha :as s]
'[grasp.api :as g])
Find reify
usage with more than one interface:
(def clojure-core (slurp (io/resource "clojure/core.clj")))
(s/def ::clause (s/cat :sym symbol? :lists (s/+ list?)))
(s/def ::reify
(s/cat :reify #{'reify}
:clauses (s/cat :clause ::clause :clauses (s/+ ::clause))))
(def matches (g/grasp-string clojure-core ::reify))
(doseq [m matches]
(prn (meta m))
(pprint/pprint m)
(println))
This outputs:
{:line 6974, :column 5, :end-line 6988, :end-column 56}
(reify
clojure.lang.IDeref
(deref [_] (deref-future fut))
clojure.lang.IBlockingDeref
(deref
[_ timeout-ms timeout-val]
(deref-future fut timeout-ms timeout-val))
...)
{:line 7107, :column 5, :end-line 7125, :end-column 16}
(reify
clojure.lang.IDeref
...)
(output abbreviated for readability)
Find all usages of clojure.set/difference
:
(defn table-row [sexpr]
(-> (meta sexpr)
(select-keys [:uri :line :column])
(assoc :sexpr sexpr)))
(->>
(g/grasp "/Users/borkdude/git/clojure/src"
;; Alt 1: using rsym:
(g/rsym 'clojure.set/difference)
;; Alt 2: do it manually:
#_(fn [sym]
(when (symbol? sym)
(= 'clojure.set/difference (g/resolve-symbol sym)))))
(map table-row)
pprint/print-table)
This outputs:
| :uri | :line | :column | :sexpr |
|--------------------------------------------------------------+-------+---------+----------------|
| file:/Users/borkdude/git/clojure/src/clj/clojure/set.clj | 49 | 7 | difference |
| file:/Users/borkdude/git/clojure/src/clj/clojure/set.clj | 62 | 14 | difference |
| file:/Users/borkdude/git/clojure/src/clj/clojure/set.clj | 172 | 2 | difference |
| file:/Users/borkdude/git/clojure/src/clj/clojure/data.clj | 112 | 19 | set/difference |
| file:/Users/borkdude/git/clojure/src/clj/clojure/data.clj | 113 | 19 | set/difference |
| file:/Users/borkdude/git/clojure/src/clj/clojure/reflect.clj | 107 | 37 | set/difference |
Find all calls to clojure.core/map
that take 1 argument:
(g/grasp-string "(comment (map identity))" (g/seq (g/rsym 'clojure.core/map) any?))
; => [(map identity)]
Grasp the entire classpath for usage of frequencies
:
(->> (g/grasp (System/getProperty "java.class.path") #{'frequencies})
(take 2)
(map (comp #(select-keys % [:uri :line]) meta)))
Output:
({:uri "file:/Users/borkdude/.gitlibs/libs/borkdude/sci/cb96d7fb2a37a7c21c78fc145948d6867c30936a/src/sci/impl/namespaces.cljc", :line 815}
{:uri "file:/Users/borkdude/.gitlibs/libs/borkdude/sci/cb96d7fb2a37a7c21c78fc145948d6867c30936a/src/sci/impl/namespaces.cljc", :line 815})
When searching for keywords you will run into the problem that they do not have
location information because they can't carry metadata. To solve this problem,
grasp lets you wrap non-metadata supporting forms in a container. Grasp exposes
the unwrap
function to get hold of the form, while you can access the location
of that form using the container's metadata. Say we would like to find all
occurrences of :my.cljs.app.subs/my-data
in this example:
/tmp/code.clj
:
(ns my.cljs.app.views
(:require [my.cljs.app.subs :as subs]
[re-frame.core :refer [subscribe]]))
(subscribe [::subs/my-data])
(subscribe [:my.cljs.app.subs/my-data])
We can find them like this:
(s/def ::subscription (fn [x] (= :my.cljs.app.subs/my-data (unwrap x))))
(def matches
(grasp "/tmp/code.clj" ::subscription {:wrap true}))
(run! prn (map meta matches))
Note that you explicitly have to provide :wrap true
to make grasp wrap
keywords.
The output:
{:line 5, :column 13, :end-line 5, :end-column 27, :uri "file:/tmp/code.clj"}
{:line 6, :column 13, :end-line 6, :end-column 38, :uri "file:/tmp/code.clj"}
Grasp supports a custom :keep-fn
, the function which decides whether to
collect a matched result. The default :keep-fn
is:
(defn default-keep-fn
[{:keys [spec expr uri]}]
(when (s/valid? spec expr)
(impl/with-uri expr uri)))
When a spec result is valid, then the URI is attached to the result's metadata and kept.
In a custom :keep-fn
you are able to call s/conform
and keep that result around:
(defn keep-fn [{:keys [spec expr uri]}]
(let [conformed (s/conform spec expr)]
(when-not (s/invalid? conformed)
{:var-name (grasp/resolve-symbol (second expr))
:expr expr
:uri uri})))
Now the result of g/grasp
will be a seq of maps instead of expressions and you
can do whatever you want with it.
Using the option :source true
, grasp will attach the source string as metadata
on parsed s-expressions. This can be used to match on things like function
literals like #(foo %)
or keywords like ::foo
. For example: we can grasp for
function literals that have more than one argument:
(s/def ::fn-literal
(fn [x] (and (seq? x) (= 'fn* (first x)) (> (count (second x)) 1)
(some-> x meta :source (str/starts-with? "#("))))))
(def match (first (g/grasp-string "#(+ % %2)" ::fn-literal {:source true})))
(prn [match (meta match)])
Output:
[(fn* [%1 %2] (+ %1 %2)) {:source "#(+ % %2)", :line 1, :column 1, :end-line 1, :end-column 10}]
More examples in examples.
Grasp exposes the cat
, seq
, vec
and or
convenience macros.
All of these macros support passing in a single quoted value for matching a
literal thing 'foo
for matching that symbol instead of
#{'foo}
. Additionally, they let you write specs without names for each parsed
item: (g/cat 'foo int?)
instead of (s/cat :s #{'foo} :i int?)
. The seq
and vec
macros are like the cat
macro but additionally check for seq?
and
vector?
respectively.
A CLI binary can be obtained from Github releases.
It can be invoked like this:
$ ./grasp ~/git/spec.alpha/src -e "(set-opts! {:wrap true}) (fn [k] (= :clojure.spec.alpha/invalid (unwrap k)))" | grep file | wc -l
68
The binary supports the following options:
-p, --path: path
-e, --expr: spec from expr
-f, --file: spec from file
The path and spec may also be provided without flags, like grasp <path> <spec>
. Use -
for grasping from stdin.
The evaluated code from -e
or -f
may return a spec (or spec keyword) or call
set-opts!
with a map that contains :spec
and other options. E.g.:
(require '[clojure.spec.alpha :as s])
(require '[grasp.api :as g])
(s/def ::spec (fn [x] (= :clojure.spec.alpha/invalid (g/unwrap x))))
(g/set-opts! {:spec ::spec :wrap true})
If nil
is returned from the evaluated code and set-opts!
wasn't called, the
CLI assumes that code will handle the results and no printing will be
done. These programs may call g/grasp
and pass g/*path*
which contains the
path that was passed to the CLI.
Full example:
fn_literal.clj
:
(require '[clojure.pprint :as pprint]
'[clojure.spec.alpha :as s]
'[clojure.string :as str]
'[grasp.api :as g])
(s/def ::fn-literal
(fn [x] (and (seq? x) (= 'fn* (first x)) (> (count (second x)) 1)
(some-> x meta :source (str/starts-with? "#(")))))
(let [matches (g/grasp g/*path* ::fn-literal {:source true})
rows (map (fn [match]
(let [m (meta match)]
{:source (:source m)
:match match}))
matches)]
(pprint/print-table rows))
$ grasp - fn_literal.clj <<< "#(foo %1 %2)"
| :uri | :line | :source | :match |
|-------+-------+--------------+---------------------------|
| stdin | 1 | #(foo %1 %2) | (fn* [%1 %2] (foo %1 %2)) |
The matched s-expressions can be conformed and then pattern-matched using libraries like meander.
Revisiting the ::reify
spec which finds reify usage with more than one
interface:
(s/def ::clause (s/cat :sym symbol? :lists (s/+ list?)))
(s/def ::reify
(s/cat :reify #{'reify}
:clauses (s/cat :clause ::clause :clauses (s/+ ::clause))))
(def clojure-core (slurp (io/resource "clojure/core.clj")))
(def matches (g/grasp-string clojure-core ::reify))
(def conformed (map #(s/conform ::reify %) matches))
(require '[matchete.core :as mc])
(def pattern
{:clauses
{:clause {:sym '!interface}
:clauses (mc/each {:sym '!interface})}})
(first (mc/matches pattern (first conformed)))
Returns:
{!interface [clojure.lang.IDeref clojure.lang.IBlockingDeref clojure.lang.IPending java.util.concurrent.Future]}
(require '[meander.epsilon :as m])
(m/find
(first conformed)
{:clauses {:clause {:sym !interface} :clauses [{:sym !interface} ...]}}
!interface)
Returns:
[clojure.lang.IDeref clojure.lang.IBlockingDeref clojure.lang.IPending java.util.concurrent.Future]
Run script/compile
to compile the grasp
binary using
GraalVM
Copyright © 2020 Michiel Borkent
Distributed under the EPL License. See LICENSE.