package core_kernel
Install
Dune Dependency
Authors
Maintainers
Sources
sha256=e37370bad978cfb71fdaf2b1a25ab1506b98ef0b91e0dbd189ffd9d853245ce2
doc/core_kernel.pooled_hashtbl/Pooled_hashtbl/index.html
Module Pooled_hashtbl
Source
A polymorphic hashtbl that uses Pool
to avoid allocation.
This uses the standard linked-chain hashtable algorithm, albeit with links performed through a pool and hence avoiding caml_modify
(for table manipulation), even when hashing object keys/values.
This implementation is worth exploring for your application if profiling demonstrates that garbage collection and the caml_modify
write barrier are a significant part of your execution time.
include Core.Hashtbl_intf.Hashtbl
We provide a sexp_of_t
but not a t_of_sexp
for this type because one needs to be explicit about the hash and comparison functions used when creating a hashtable. Note that Hashtbl.Poly.t
does have [@@deriving sexp]
, and uses OCaml's built-in polymorphic comparison and and polymorphic hashing.
Creators
The module you pass to create
must have a type that is hashable, sexpable, and comparable.
Example:
Hashtbl.create (module Int);; - : (int, '_a) Hashtbl.t = <abstr>;;
val of_alist :
?growth_allowed:bool ->
?size:int ->
'a Base__.Hashtbl_intf.Key.t ->
('a * 'b) list ->
[ `Ok of ('a, 'b) t | `Duplicate_key of 'a ]
Example:
Hashtbl.of_alist (module Int) [(3, "something"); (2, "whatever")] - : [ `Duplicate_key of int | `Ok of (int, string) Hashtbl.t ] = `Ok <abstr>
val of_alist_report_all_dups :
?growth_allowed:bool ->
?size:int ->
'a Base__.Hashtbl_intf.Key.t ->
('a * 'b) list ->
[ `Ok of ('a, 'b) t | `Duplicate_keys of 'a list ]
Whereas of_alist
will report Duplicate_key
no matter how many dups there are in your list, of_alist_report_all_dups
will report each and every duplicate entry.
For example:
Hashtbl.of_alist (module Int) [(1, "foo"); (1, "bar"); (2, "foo"); (2, "bar")];; - : [ `Duplicate_key of int | `Ok of (int, string) Hashtbl.t ] = `Duplicate_key 1 Hashtbl.of_alist_report_all_dups (module Int) [(1, "foo"); (1, "bar"); (2, "foo"); (2, "bar")];; - : [ `Duplicate_keys of int list | `Ok of (int, string) Hashtbl.t ] = `Duplicate_keys [1; 2]
val of_alist_multi :
?growth_allowed:bool ->
?size:int ->
'a Base__.Hashtbl_intf.Key.t ->
('a * 'b) list ->
('a, 'b list) t
Creates a "multi" hashtable, i.e., a hashtable where each key points to a list potentially containing multiple values. So instead of short-circuiting with a `Duplicate_key
variant on duplicates, as in of_alist
, of_alist_multi
folds those values into a list for the given key:
let h = Hashtbl.of_alist_multi (module Int) [(1, "a"); (1, "b"); (2, "c"); (2, "d")];; val h : (int, string list) Hashtbl.t = <abstr> Hashtbl.find_exn h 1;; - : string list = ["b"; "a"]
val create_mapped :
?growth_allowed:bool ->
?size:int ->
'a Base__.Hashtbl_intf.Key.t ->
get_key:('r -> 'a) ->
get_data:('r -> 'b) ->
'r list ->
[ `Ok of ('a, 'b) t | `Duplicate_keys of 'a list ]
Applies the get_key
and get_data
functions to the 'r list
to create the initial keys and values, respectively, for the new hashtable.
create_mapped get_key get_data [x1;...;xn]
= of_alist [get_key x1, get_data x1; ...; get_key xn, get_data xn]
Example:
let h = Hashtbl.create_mapped (module Int) ~get_key:((fun x -> x)[@local]) ~get_data:((fun x -> x + 1)[@local]) [1; 2; 3];; val h : [ `Duplicate_keys of int list | `Ok of (int, int) Hashtbl.t ] = `Ok <abstr> let h = match h with | `Ok x -> x | `Duplicate_keys _ -> failwith "" in Hashtbl.find_exn h 1;; - : int = 2
val create_with_key :
?growth_allowed:bool ->
?size:int ->
'a Base__.Hashtbl_intf.Key.t ->
get_key:('r -> 'a) ->
'r list ->
[ `Ok of ('a, 'r) t | `Duplicate_keys of 'a list ]
create_with_key ~get_key [x1;...;xn]
= of_alist [get_key x1, x1; ...; get_key xn, xn]
val group :
?growth_allowed:bool ->
?size:int ->
'a Base__.Hashtbl_intf.Key.t ->
get_key:('r -> 'a) ->
get_data:('r -> 'b) ->
combine:('b -> 'b -> 'b) ->
'r list ->
('a, 'b) t
Like create_mapped
, applies the get_key
and get_data
functions to the 'r list
to create the initial keys and values, respectively, for the new hashtable -- and then, like add_multi
, folds together values belonging to the same keys. Here, though, the function used for the folding is given by combine
(instead of just being a cons
).
Example:
Hashtbl.group (module Int) ~get_key:((fun x -> x / 2)[@local]) ~get_data:((fun x -> x)[@local]) ~combine:((fun x y -> x * y)[@local]) [ 1; 2; 3; 4] |> Hashtbl.to_alist;; - : (int * int) list = [(2, 4); (1, 6); (0, 1)]
Accessors
Attempting to modify (set
, remove
, etc.) the hashtable during iteration (fold
, iter
, iter_keys
, iteri
) will raise an exception.
Iterates over both keys and values.
Example:
let h = Hashtbl.of_alist_exn (module Int) [(1, 4); (5, 6)] in Hashtbl.iteri h ~f:(fun ~key ~data -> print_endline (Printf.sprintf "%d-%d" key data));; 1-4 5-6 - : unit = ()
Choose an arbitrary key/value pair of a hash table. Returns None
if t
is empty.
The choice is deterministic. Calling choose
multiple times on the same table returns the same key/value pair, so long as the table is not mutated in between. Beyond determinism, no guarantees are made about how the choice is made. Expect bias toward certain hash values.
This hash bias can lead to degenerate performance in some cases, such as clearing a hash table using repeated choose
and remove
. At each iteration, finding the next element may have to scan farther from its initial hash value.
val choose_randomly :
?random_state:Base__.Random.State.t ->
('a, 'b) t ->
('a key * 'b) option
Chooses a random key/value pair of a hash table. Returns None
if t
is empty.
The choice is distributed uniformly across hash values, rather than across keys themselves. As a consequence, the closer the keys are to evenly spaced out in the table, the closer this function will be to a uniform choice of keys.
This function may be preferable to choose
when nondeterministic choice is acceptable, and bias toward certain hash values is undesirable.
Like choose_randomly
. Raises if t
is empty.
add
and add_exn
leave the table unchanged if the key was already present.
change t key ~f
changes t
's value for key
to be f (find t key)
.
update t key ~f
is change t key ~f:(fun o -> Some (f o))
.
update_and_return t key ~f
is update
, but returns the result of f o
.
map t f
returns a new table with values replaced by the result of applying f
to the current values.
Example:
let h = Hashtbl.of_alist_exn (module Int) [(1, 4); (5, 6)] in let h' = Hashtbl.map h ~f:((fun x -> x * 2)[@local]) in Hashtbl.to_alist h';; - : (int * int) list = [(5, 12); (1, 8)]
Like map
, but the function f
takes both key and data as arguments.
Returns a new table by filtering the given table's values by f
: the keys for which f
applied to the current value returns Some
are kept, and those for which it returns None
are discarded.
Example:
let h = Hashtbl.of_alist_exn (module Int) [(1, 4); (5, 6)] in Hashtbl.filter_map h ~f:((fun x -> if x > 5 then Some x else None)[@local]) |> Hashtbl.to_alist;; - : (int * int) list = [(5, 6)]
Like filter_map
, but the function f
takes both key and data as arguments.
val partition_map :
('a, 'b) t ->
f:('b -> ('c, 'd) Base__.Either.t) ->
('a, 'c) t * ('a, 'd) t
Returns new tables with bound values partitioned by f
applied to the bound values.
val partition_mapi :
('a, 'b) t ->
f:(key:'a key -> data:'b -> ('c, 'd) Base__.Either.t) ->
('a, 'c) t * ('a, 'd) t
Like partition_map
, but the function f
takes both key and data as arguments.
Returns a pair of tables (t1, t2)
, where t1
contains all the elements of the initial table which satisfy the predicate f
, and t2
contains the rest.
val partitioni_tf :
('a, 'b) t ->
f:(key:'a key -> data:'b -> bool) ->
('a, 'b) t * ('a, 'b) t
Like partition_tf
, but the function f
takes both key and data as arguments.
find_or_add t k ~default
returns the data associated with key k
if it is in the table t
, and otherwise assigns k
the value returned by default ()
.
Like find_or_add
but default
takes the key as an argument.
find t k
returns Some
(the current binding) of k
in t
, or None
if no such binding exists.
find_exn t k
returns the current binding of k
in t
, or raises Stdlib.Not_found
or Not_found_s
if no such binding exists.
val find_and_call :
('a, 'b) t ->
'a key ->
if_found:('b -> 'c) ->
if_not_found:('a key -> 'c) ->
'c
find_and_call t k ~if_found ~if_not_found
is equivalent to:
match find t k with Some v -> if_found v | None -> if_not_found k
except that it doesn't allocate the option.
val find_and_call1 :
('a, 'b) t ->
'a key ->
a:'d ->
if_found:('b -> 'd -> 'c) ->
if_not_found:('a key -> 'd -> 'c) ->
'c
Just like find_and_call
, but takes an extra argument which is passed to if_found
and if_not_found
, so that the client code can avoid allocating closures or using refs to pass this additional information. This function is only useful in code which tries to minimize heap allocation.
find_and_remove t k
returns Some (the current binding) of k in t and removes it, or None is no such binding exists.
val merge :
('k, 'a) t ->
('k, 'b) t ->
f:
(key:'k key ->
[ `Left of 'a | `Right of 'b | `Both of 'a * 'b ] ->
'c option) ->
('k, 'c) t
Merges two hashtables.
The result of merge f h1 h2
has as keys the set of all k
in the union of the sets of keys of h1
and h2
for which d(k)
is not None, where:
d(k) =
f ~key:k (`Left d1)
ifk
inh1
maps to d1, andh2
does not have data fork
;
f ~key:k (`Right d2)
ifk
inh2
maps to d2, andh1
does not have data fork
;
f ~key:k (`Both (d1, d2))
otherwise, wherek
inh1
maps tod1
andk
inh2
maps tod2
.
Each key k
is mapped to a single piece of data x
, where d(k) = Some x
.
Example:
let h1 = Hashtbl.of_alist_exn (module Int) [(1, 5); (2, 3232)] in let h2 = Hashtbl.of_alist_exn (module Int) [(1, 3)] in Hashtbl.merge h1 h2 ~f:(fun ~key:_ -> function | `Left x -> Some (`Left x) | `Right x -> Some (`Right x) | `Both (x, y) -> if x=y then None else Some (`Both (x,y)) ) |> Hashtbl.to_alist;; - : (int * [> `Both of int * int | `Left of int | `Right of int ]) list = [(2, `Left 3232); (1, `Both (5, 3))]
val merge_into :
src:('k, 'a) t ->
dst:('k, 'b) t ->
f:
(key:'k key ->
'a ->
'b option ->
'b Base__.Hashtbl_intf.Merge_into_action.t) ->
unit
Every key
in src
will be removed or set in dst
according to the return value of f
.
filter_inplace t ~f
removes all the elements from t
that don't satisfy f
.
map_inplace t ~f
applies f
to all elements in t
, transforming them in place.
filter_map_inplace
combines the effects of map_inplace
and filter_inplace
.
equal f t1 t2
and similar f t1 t2
both return true iff t1
and t2
have the same keys and for all keys k
, f (find_exn t1 k) (find_exn t2 k)
. equal
and similar
only differ in their types.
Returns the list of all (key, data) pairs for given hashtable.
remove_if_zero
's default is false
.
add_multi t ~key ~data
if key
is present in the table then cons data
on the list, otherwise add key
with a single element list.
remove_multi t key
updates the table, removing the head of the list bound to key
. If the list has only one element (or is empty) then the binding is removed.
find_multi t key
returns the empty list if key
is not present in the table, returns t
's values for key
otherwise.
val validate :
name:('a key -> Base.String.t) ->
'b Validate.check ->
('a, 'b) t Validate.check
module Hashable = Base.Hashable
module Merge_into_action = Base.Hashtbl.Merge_into_action
include For_deriving with type ('a, 'b) t := ('a, 'b) t
include Base.Hashtbl.For_deriving with type ('a, 'b) t := ('a, 'b) t
val m__t_sexp_grammar :
(module M_sexp_grammar with type t = 'k) ->
'v Sexplib0.Sexp_grammar.t ->
('k, 'v) t Sexplib0.Sexp_grammar.t
val quickcheck_generator_m__t :
(module M_quickcheck with type t = 'k) ->
'v Base_quickcheck.Generator.t ->
('k, 'v) t Core.Quickcheck.Generator.t
val quickcheck_observer_m__t :
(module M_quickcheck with type t = 'k) ->
'v Core.Quickcheck.Observer.t ->
('k, 'v) t Core.Quickcheck.Observer.t
val quickcheck_shrinker_m__t :
(module M_quickcheck with type t = 'k) ->
'v Core.Quickcheck.Shrinker.t ->
('k, 'v) t Core.Quickcheck.Shrinker.t
resize t size
ensures that t
can hold at least size
entries without resizing (again), provided that t
has growth enabled. This is useful for sizing global tables during application initialization, to avoid subsequent, expensive growth online. See Immediate.String.resize
, for example.
val on_grow :
before:(unit -> 'a) ->
after:('a -> old_capacity:int -> new_capacity:int -> unit) ->
unit
on_grow ~before ~after
allows you to connect higher level loggers to the point where these hashtbls grow. before
is called before the table grows, and after
after it. This permits you to e.g. measure the time elapsed between the two.
This is only meant for debugging and profiling, e.g. note that once a callback is installed, there is no way to remove it.