SoleData

Welcome to the documentation for SoleData.

SoleData.BASE_FEATURE_FUNCTIONS_ALIASES
SoleLogics.IA_L
SoleData.AbstractCondition
SoleData.AbstractFeature
SoleData.AbstractFullMemoset
SoleData.AbstractLogiset
SoleData.AbstractMemoset
SoleData.AbstractModalLogiset
SoleData.AbstractOneStepMemoset
SoleData.AbstractScalarOneStepGlobalMemoset
SoleData.AbstractScalarOneStepRelationalMemoset
SoleData.AbstractUnivariateFeature
SoleData.Aggregator
SoleData.DimensionalDatasets.UniformFullDimensionalLogiset
SoleData.DimensionalDatasets.UniformFullDimensionalOneStepRelationalMemoset
SoleData.ExplicitBooleanModalLogiset
SoleData.ExplicitFeature
SoleData.ExplicitModalLogiset
SoleData.Feature
SoleData.FullMemoset
SoleData.FunctionalCondition
SoleData.MixedCondition
SoleData.MultiFormula
SoleData.MultiLogiset
SoleData.MultivariateFeature
SoleData.ObliqueScalarCondition
SoleData.PropositionalLogiset
SoleData.RangeScalarCondition
SoleData.ScalarCondition
SoleData.ScalarExistentialFormula
SoleData.ScalarFormula
SoleData.ScalarMetaCondition
SoleData.ScalarOneStepMemoset
SoleData.ScalarOneStepRelationalMemoset
SoleData.ScalarUniversalFormula
SoleData.SupportedLogiset
SoleData.TestOperator
SoleData.UnivariateFeature
SoleData.UnivariateNamedFeature
SoleData.UnivariateScalarAlphabet
SoleData.ValueCondition
SoleData.VarFeature
SoleData.VariableAvg
SoleData.VariableDistance
SoleData.VariableMax
SoleData.VariableMin
SoleData.VariableSoftMax
SoleData.VariableSoftMin
SoleData.VariableValue
SoleLogics.AbstractFrame
SoleLogics.AbstractWorld
SoleLogics.Atom
SoleLogics.Interval
SoleLogics.Interval2D
SoleData.apply_test_operator
SoleData.capacity
SoleData.checkcondition
SoleData.computefeature
SoleData.computeunivariatefeature
SoleData.features
SoleData.featvaltype
SoleData.featvalue
SoleData.initlogiset
SoleData.islogiseed
SoleData.ismultilogiseed
SoleData.minify
SoleData.modforms
SoleData.naturalgrouping
SoleData.nfeatures
SoleData.nmemoizedvalues
SoleData.parsecondition
SoleData.parsefeature
SoleData.parsefeature
SoleData.representatives
SoleData.scalaralphabet
SoleData.scalarlogiset
SoleData.variable_name
SoleLogics.accessibles
SoleLogics.alphabet
SoleLogics.check
SoleLogics.syntaxstring
SoleData.@scalarformula

Logical foundations

Here are some core concepts for symbolic artificial intelligence with propositional and modal logics.o

SoleLogics.Atom — Type

struct Atom{V} <: AbstractAtom
    value::V
end

Simplest atom implementation, wrapping a value.

source

SoleLogics.AbstractWorld — Type

abstract type AbstractWorld end

Abstract type for the nodes of an annotated accessibility graph (Kripke structure). This is used, for example, in modal logic, where the truth of formulas is relativized to worlds, that is, nodes of a graph.

Implementing

When implementing a new world type, the logical semantics should be defined via accessibles methods; refer to the help for accessibles.

source

SoleLogics.Interval — Type

struct Interval{T<:Real} <: GeometricalWorld
    x :: T
    y :: T
end

An interval in a 1-dimensional space, with coordinates of type T.

Examples

julia> SoleLogics.goeswithdim(SoleLogics.Interval(1,2),1)
true

julia> SoleLogics.goeswithdim(SoleLogics.Interval(1,2),2)
false

julia> collect(accessibles(SoleLogics.FullDimensionalFrame(5), Interval(1,2), SoleLogics.IA_L))
6-element Vector{Interval{Int64}}:
 (3−4)
 (3−5)
 (4−5)
 (3−6)
 (4−6)
 (5−6)

source

SoleLogics.Interval2D — Type

struct Interval2D{T<:Real} <: GeometricalWorld
    x :: Interval{T}
    y :: Interval{T}
end

A orthogonal rectangle in a 2-dimensional space, with coordinates of type T. This is the 2-dimensional Interval counterpart, that is, the combination of two orthogonal Intervals.

Examples

julia> SoleLogics.goeswithdim(SoleLogics.Interval2D((1,2),(3,4)),1)
false

julia> SoleLogics.goeswithdim(SoleLogics.Interval2D((1,2),(3,4)),2)
true

julia> collect(accessibles(SoleLogics.FullDimensionalFrame(5,5), Interval2D((2,3),(2,4)), SoleLogics.IA_LL))
3-element Vector{Interval2D{Int64}}:
 ((4−5)×(5−6))
 ((4−6)×(5−6))
 ((5−6)×(5−6))

source

SoleLogics.syntaxstring — Function

syntaxstring(s::Syntactical; kwargs...)::String

Return the string representation of any syntactic object (e.g., Formula, SyntaxTree, SyntaxToken, Atom, Truth, etc). Note that this representation may introduce redundant parentheses. kwargs can be used to specify how to display syntax tokens/trees under some specific conditions.

The following kwargs are currently supported:

function_notation = false::Bool: when set to true, it forces the use of function notation for binary operators (see here).
remove_redundant_parentheses = true::Bool: when set to false, it prints a syntaxstring where each syntactical element is wrapped in parentheses.
parenthesize_atoms = !remove_redundant_parentheses::Bool: when set to true, it forces the atoms (which are the leaves of a formula's tree structure) to be wrapped in parentheses.

Examples

julia> syntaxstring(parseformula("p∧q∧r∧s∧t"))
"p ∧ q ∧ r ∧ s ∧ t"

julia> syntaxstring(parseformula("p∧q∧r∧s∧t"), function_notation=true)
"∧(∧(∧(∧(p, q), r), s), t)"

julia> syntaxstring(parseformula("p∧q∧r∧s∧t"), remove_redundant_parentheses=false)
"((((p) ∧ (q)) ∧ (r)) ∧ (s)) ∧ (t)"

julia> syntaxstring(parseformula("p∧q∧r∧s∧t"), remove_redundant_parentheses=true, parenthesize_atoms=true)
"(p) ∧ (q) ∧ (r) ∧ (s) ∧ (t)"

julia> syntaxstring(parseformula("◊((p∧s)→q)"))
"◊((p ∧ s) → q)"

julia> syntaxstring(parseformula("◊((p∧s)→q)"); function_notation = true)
"◊(→(∧(p, s), q))"

Implementation

In the case of a syntax tree, syntaxstring is a recursive function that calls itself on the syntax children of each node. For a correct functioning, the syntaxstring must be defined (including the kwargs... part!) for every newly defined SyntaxToken (e.g., SyntaxLeafs, that is, Atoms and Truth values, and Operators), in a way that it produces a unique string representation, since Base.hash and Base.isequal, at least for SyntaxTrees, rely on it.

In particular, for the case of Atoms, the function calls itself on the wrapped value:

syntaxstring(a::Atom; kwargs...) = syntaxstring(value(a); kwargs...)

The syntaxstring for any value defaults to its string representation, but it can be defined by defining the appropriate syntaxstring method.

Warning

The syntaxstring for syntax tokens (e.g., atoms, operators) should not be prefixed/suffixed by whitespaces, as this may cause ambiguities upon parsing. For similar reasons, syntaxstrings should not contain parentheses ('(', ')'), and, when parsing in function notation, commas (',').

source

SoleLogics.IA_L — Constant

See IntervalRelation.

source

SoleLogics.AbstractFrame — Type

abstract type AbstractFrame{W<:AbstractWorld} end

Abstract type for an accessibility graph (Kripke frame), that gives the topology to Kripke structures. A frame can be queried for its set of vertices (also called worlds, see allworlds), and it can be browsed via its accessibility relation(s) (see accessibles). Refer to FullDimensionalFrame as an example.

source

SoleLogics.accessibles — Function

accessibles(fr::AbstractUniModalFrame{W}, w::W)::Worlds{W} where {W<:AbstractWorld}

Return the worlds in frame fr that are accessible from world w.

source

accessibles(
    fr::AbstractMultiModalFrame{W},
    w::W,
    r::AbstractRelation
) where {W<:AbstractWorld}

Return the worlds in frame fr that are accessible from world w via relation r.

Examples

julia> fr = SoleLogics.FullDimensionalFrame((10,), Interval{Int});

julia> typeof(accessibles(fr, Interval(2,5), IA_L))
Base.Generator{...}

julia> typeof(accessibles(fr, globalrel))
Base.Generator{...}

julia> @assert SoleLogics.nworlds(fr) == length(collect(accessibles(fr, globalrel)))

julia> typeof(accessibles(fr, Interval(2,5), identityrel))
Vector{Interval{Int64}}

julia> Interval(8,11) in collect(accessibles(fr, Interval(2,5), IA_L))
true

Implementation

Since accessibles always returns an iterator to worlds of the same type W, the current implementation of accessibles for multi-modal frames delegates the enumeration to a lower level _accessibles function, which returns an iterator to parameter tuples that are, then, fed to the world constructor the using IterTools generators, as in:

function accessibles(
    fr::AbstractMultiModalFrame{W},
    w::W,
    r::AbstractRelation,
) where {W<:AbstractWorld}
    IterTools.imap(W, _accessibles(fr, w, r))
end

As such, when defining new frames, worlds, and/or relations, one should provide new methods for _accessibles. For example:

_accessibles(fr::Full1DFrame, w::Interval{<:Integer}, ::_IA_A) = zip(Iterators.repeated(w.y), w.y+1:X(fr)+1)

This pattern is generally convenient; it can, however, be bypassed, although this requires defining two additional methods in order to resolve dispatch ambiguities. When defining a new frame type FR{W}, one can resolve the ambiguities and define a custom accessibles method by providing these three methods:

# access worlds through relation `r`
function accessibles(
    fr::FR{W},
    w::W,
    r::AbstractRelation,
) where {W<:AbstractWorld}
    ...
end

# access current world
function accessibles(
    fr::FR{W},
    w::W,
    r::IdentityRel,
) where {W<:AbstractWorld}
    [w]
end

# access all worlds
function accessibles(
    fr::FR{W},
    w::W,
    r::GlobalRel,
) where {W<:AbstractWorld}
    allworlds(fr)
end

In general, it should be true that collect(accessibles(fr, w, r)) isa AbstractWorlds{W}.

source

SoleData.minify — Function

minify(dataset::D1)::Tuple{D2,Function} where {D1,D2}

Return a minified version of a dataset, as well as a backmap for reverting to the original dataset. Dataset minification remaps each scalar values in the dataset to a new value such that the overall order of the values is preserved; the output dataset is smaller in size, since it relies on values of type UInt8, UInt16, UInt32, etc.

API

Ontop of the logical layer, we define features, conditions on features, logisets, and memosets.

SoleData.AbstractFeature — Type

abstract type AbstractFeature end

Abstract type for features of worlds of [Kripke structures](https://en.wikipedia.org/wiki/Kripkestructure(model_checking).

source

SoleData.parsefeature — Method

parsefeature(FT::Type{<:AbstractFeature}, expr::AbstractString; kwargs...)

Parse a feature of type FT from its syntaxstring representation. Depending on FT, specifying keyword arguments such as featvaltype::Type may be required or recommended.

Utilities

Logisets

SoleData.ExplicitFeature — Type

struct ExplicitFeature{T} <: AbstractFeature
    name::String
    featstruct
end

A feature encoded explicitly, for example, as a slice of DimensionalDatasets.UniformFullDimensionalLogiset's feature structure.

See also AbstractFeature.

source

SoleData.Feature — Type

struct Feature{A} <: AbstractFeature
    atom::A
end

A feature solely identified by an atom (e.g., a string with its name, a tuple of strings, etc.)

See also AbstractFeature.

source

SoleData.FunctionalCondition — Type

struct FunctionalCondition{FT<:AbstractFeature} <: AbstractCondition{FT}
    feature::FT
    f::FT
end

A condition which yields a truth value equal to the value of a function.

See also AbstractFeature.

source

SoleData.ValueCondition — Type

struct ValueCondition{FT<:AbstractFeature} <: AbstractCondition{FT}
    feature::FT
end

A condition which yields a truth value equal to the value of a feature.

See also AbstractFeature.

source

SoleData.ExplicitBooleanModalLogiset — Type

struct ExplicitBooleanModalLogiset{
    W<:AbstractWorld,
    FT<:AbstractFeature,
    FR<:AbstractFrame{W},
} <: AbstractModalLogiset{W,Bool,FT,FR}

    d :: Vector{Tuple{Dict{W,Vector{FT}},FR}}

end

A logiset where the features are boolean, and where each instance associates to each world the set of features with true.

Scalar Logisets

SoleData.MixedCondition — Type

A union type for all condition-inducing objects. An object of this type, coupled with a (e.g., dimensional) dataset will induce a set of conditions in scalarlogiset.

source

SoleData.@scalarformula — Macro

@scalarformula expr

Parse a logical formula on scalar conditions, such as V1 > 10. Note that logical operators take precedence over comparison operators, so it is often the case that expressions such as V1 > 10 must be wrapped in parentheses.

Examples

julia> φ = @scalarformula ((V1 > 10) ∧ (V2 < 0) ∧ (V2 < 0) ∧ (V2 <= 0)) ∨ ((V1 <= 0) ∧ ((V1 <= 3)) ∧ (V2 >= 2))
SyntaxBranch: (V1 > 10 ∧ V2 < 0 ∧ V2 < 0 ∧ V2 ≤ 0) ∨ (V1 ≤ 0 ∧ V1 ≤ 3 ∧ V2 ≥ 2)

source

SoleData.BASE_FEATURE_FUNCTIONS_ALIASES — Constant

Syntaxstring aliases for standard features, such as "min", "max", "avg".

source

SoleData.AbstractUnivariateFeature — Type

abstract type AbstractUnivariateFeature <: VarFeature end

A dimensional feature represented by the application of a function to a single variable of a dimensional channel. For example, it can wrap a scalar function computing how much red a Interval2D world, when interpreted on an image, contains.

source

SoleData.MultivariateFeature — Type

struct MultivariateFeature{U} <: VarFeature
    f::Function
end

A dimensional feature represented by the application of a function to a dimensional channel. For example, it can wrap a scalar function computing how much a Interval2D world, when interpreted on an image, resembles a horse. Note that the image has a number of spatial variables (3, for the case of RGB), and "resembling a horse" may require a computation involving all variables.

source

SoleData.UnivariateFeature — Type

struct UnivariateFeature{U,I<:VariableId} <: AbstractUnivariateFeature
    i_variable::I
    f::Function
    fname::Union{Nothing,String}
end

A dimensional feature represented by the application of a generic function f to a single variable of a dimensional channel. For example, it can wrap a scalar function computing how much red a Interval2D world, when interpreted on an image, contains. Optionally, a feature name fname can be attached to the function, which can be useful for inspection (e.g., if f is an anonymous function, this avoids names such s "#47" or "#49".

source

SoleData.UnivariateNamedFeature — Type

struct UnivariateNamedFeature{U,I<:VariableId} <: AbstractUnivariateFeature
    i_variable::I
    name::String
end

A univariate feature solely identified by its name and reference variable.

source

SoleData.VarFeature — Type

abstract type VarFeature <: AbstractFeature end

Abstract type for feature functions that can be computed on (multi)variate data. Instances of multivariate datasets have values for a number of variables, which can be used to define logical features.

For example, with dimensional data (e.g., multivariate time series, digital images and videos), features can be computed as the minimum value for a given variable on a specific interval/rectangle/cuboid (in general, a SoleLogics.GeometricalWorld).

As an example of a dimensional feature, consider min[V1], which computes the minimum for variable 1 for a given world. ScalarConditions such as min[V1] >= 10 can be, then, evaluated on worlds.

source

SoleData.VariableAvg — Type

struct VariableAvg{I<:VariableId} <: AbstractUnivariateFeature
    i_variable::I
end

Univariate feature computing the average value for a given variable.

source

SoleData.VariableDistance — Type

struct VariableDistance{I<:VariableId,T} <: AbstractUnivariateFeature
    i_variable::I
    reference::T
    distance::Function
    featurename::Union{String,Symbol}
end

Univariate feature computing a distance function for a given variable, with respect to a certain reference structure.

By default, distance is set to be Euclidean distance.

Examples

julia> vd = VariableDistance(1, [1,2,3,4]; featurename="StrictMonotonicAscending");

julia> syntaxstring(vd)
"StrictMonotonicAscending[V1]"

julia> computeunivariatefeature(vd, [1,2,3,4])
0.0

julia> computeunivariatefeature(vd, [2,3,4,5])
2.0

source

SoleData.VariableMax — Type

struct VariableMax{I<:VariableId} <: AbstractUnivariateFeature
    i_variable::I
end

Notable univariate feature computing the maximum value for a given variable.

source

SoleData.VariableMin — Type

struct VariableMin{I<:VariableId} <: AbstractUnivariateFeature
    i_variable::I
end

Notable univariate feature computing the minimum value for a given variable.

source

SoleData.VariableSoftMax — Type

struct VariableSoftMax{T<:AbstractFloat,I<:VariableId} <: AbstractUnivariateFeature
    i_variable::I
    alpha::T
end

Univariate feature computing a "softened" version of the maximum value for a given variable.

source

SoleData.VariableSoftMin — Type

struct VariableSoftMin{T<:AbstractFloat,I<:VariableId} <: AbstractUnivariateFeature
    i_variable::I
    alpha::T
end

Univariate feature computing a "softened" version of the minimum value for a given variable.

source

SoleData.VariableValue — Type

struct VariableValue{I<:VariableId} <: AbstractUnivariateFeature
    i_variable::I
end

A simple feature, equal the value of a scalar variable.

source

SoleData.computefeature — Method

computefeature(f::VarFeature, featchannel; kwargs...)

Compute a feature on a featchannel (i.e., world reading) of an instance.

See also VarFeature.

source

SoleData.parsefeature — Method

parsefeature(FT::Type{<:VarFeature}, expr::AbstractString; kwargs...)

Parse a VarFeature of type FT from its syntaxstring representation.

Keyword Arguments

featvaltype::Union{Nothing,Type} = nothing: the feature's featvaltype (recommended for some features, e.g., UnivariateFeature);
opening_parenthesis::String = "[": the string signaling the opening of an expression block (e.g., "min[V2]");
closing_parenthesis::String = "]": the string signaling the closing of an expression block (e.g., "min[V2]");
additional_feature_aliases = Dict{String,Base.Callable}(): A dictionary mapping strings to callables, useful when parsing custom-made, non-standard features. By default, features such as "avg" or "min" are provided for (see SoleData.BASE_FEATURE_FUNCTIONS_ALIASES); note that, in case of clashing strings, the provided additional aliases will override the standard ones;
variable_names_map::Union{Nothing,AbstractDict,AbstractVector} = nothing: mapping from variable name to variable index, useful when parsing from syntaxstrings with variable names (e.g., "min[Heart rate]");
variable_name_prefix::String = "V": prefix used with variable indices (e.g., "V10").

Note that at most one argument in variable_names_map and variable_name_prefix should be provided.

Note

The default parentheses, here, differ from those of SoleLogics.parseformula, since features are typically wrapped into Atoms, and parseformula does not allow parenthesis characters in atoms' syntaxstrings.

source

SoleData.variable_name — Method

variable_name(
    f::AbstractUnivariateFeature;
    variable_names_map::Union{Nothing,AbstractDict,AbstractVector} = nothing,
    variable_name_prefix::Union{Nothing,String} = "V",
)::String

Return the name of the variable targeted by a univariate feature. By default, an variable name is a number prefixed by "V"; however, variable_names_map or variable_name_prefix can be used to customize variable names. The prefix can be customized by specifying variable_name_prefix. Alternatively, a mapping from string to integer (either via a Dictionary or a Vector) can be passed as variable_names_map. Note that only one in variable_names_map and variable_name_prefix should be provided.

source

SoleData.Aggregator — Type

const Aggregator = Function

A test operator is a binary Julia Function used for comparing a feature value and a threshold. In a crisp (i.e., boolean, non-fuzzy) setting, the test operator returns a Boolean value, and <, >, ≥, ≤, !=, and == are typically used.

source

SoleData.TestOperator — Type

const TestOperator = Function

source

SoleData.apply_test_operator — Method

Apply a test operator by simply passing the feature value and threshold to the (binary) test operator function.

source

SoleData.ObliqueScalarCondition — Type

ObliqueScalarCondition(features, b, u, test_operator)

An oblique scalar condition (see oblique decision trees), such as $((features - b) ⋅ u) ≥ 0$, where features is a set of $m$ features, and $b,u ∈ ℝ^m$.

source

SoleData.RangeScalarCondition — Type

struct RangeScalarCondition{U<:Number,FT<:AbstractFeature} <: AbstractScalarCondition{FT}

A condition specifying a range of values for a scalar feature.

Fields:

feature: the scalar feature
minval, maxval: the minimum and maximum values of the range
minincluded, maxincluded: whether to include the minimum and maximum values in the range, respectively

The range is specified using interval notation, where the minimum value is included if minincluded is true and excluded if it is false. Similarly, the maximum value is included if maxincluded is true and excluded if it is false.

For example, if minincluded == true and maxincluded == false, the range is [minval, maxval).

The checkcondition method checks whether the value of the feature is within the specified range.

The syntaxstring method returns a string representation of the condition in the form feature ∈ [minval, maxval], where the interval notation is used to indicate whether the minimum and maximum values are included or excluded.

source

SoleData.ScalarCondition — Type

struct ScalarCondition{U,FT<:AbstractFeature,M<:ScalarMetaCondition{FT}} <: AbstractScalarCondition{FT}
    metacond::M
    a::U
end

A scalar condition comparing a computed feature value (see ScalarMetaCondition) and a threshold value a. It can be evaluated on a world of an instance of a logical dataset.

For example: $min[V1] ≥ 10$, which translates to "Within this world, the minimum of variable 1 is greater or equal than 10." In this case, the feature a VariableMin object.

source

SoleData.ScalarMetaCondition — Type

struct ScalarMetaCondition{FT<:AbstractFeature,O<:TestOperator} <: AbstractScalarCondition{FT}
    feature::FT
    test_operator::O
end

A metacondition representing a scalar comparison method. Here, the feature is a scalar function that can be computed on a world of an instance of a logical dataset. A test operator is a binary mathematical relation, comparing the computed feature value and an external threshold value (see ScalarCondition). A metacondition can also be used for representing the infinite set of conditions that arise with a free threshold (see UnboundedScalarAlphabet): ${min[V1] ≥ a, a ∈ ℝ}$.

source

SoleData.UnivariateScalarAlphabet — Type

struct UnivariateScalarAlphabet <: AbstractAlphabet{ScalarCondition}
    featcondition::Tuple{ScalarMetaCondition,Vector}
end

A finite alphabet of conditions, grouped by (a finite set of) metaconditions.

source

SoleData.ScalarExistentialFormula — Type

Templated formula for ⟨R⟩ f ⋈ t.

source

SoleData.ScalarFormula — Type

Abstract type for templated formulas on scalar conditions.

source

SoleData.ScalarUniversalFormula — Type

Templated formula for [R] f ⋈ t.

source

SoleData.featvalue — Method

featvalue(feature, logiseed, i_instance, w)

Return the value of a feature at world on an instance of a logiset.

See islogiseed.

source

SoleData.islogiseed — Method

islogiseed(dataset)::Bool

A logiseed is a dataset that can be converted to a logiset (e.g., via scalarlogiset). If the dataset is a unimodal logiseed, the following methods should be defined:

    islogiseed(::typeof(dataset)) = true
    initlogiset(dataset, features; kwargs...)
    ninstances(dataset)
    nvariables(dataset)
    frame(dataset, i_instance::Integer)
    featvalue(feature::VarFeature, dataset, i_instance::Integer, w::AbstractWorld)
    varnames(dataset)::Union{Nothing,Vector{<:Union{Integer, Symbol}}}
    vareltype(dataset, i_variable::Union{Integer, Symbol})

If dataset is a multimodal logiseed, the following methods should be defined, while its modalities (iterated via eachmodality) should provide the methods above:

    ismultilogiseed(::typeof(dataset)) = true
    nmodalities(logiseed)
    eachmodality(logiseed)

Examples

A DataFrame

julia> using DataFrames; df = DataFrame(rand(150, 4), :auto);


julia> SoleData.islogiseed(df)
true

julia> ninstances(df), nvariables(df)
(150, 4)

julia> SoleData.varnames(df)
4-element Vector{String}:
 "x1"
 "x2"
 "x3"
 "x4"

A Vector of multidimensional instances (i.e., instances that are Array{Number,N} with N ≥ 1, where the last dimension is that of variables)

julia> X = [rand(4) for i in 1:150];


julia> SoleData.islogiseed(X)
true

julia> ninstances(X), nvariables(X)
(150, 4)

julia> SoleData.varnames(X)
nothing

source

SoleData.ismultilogiseed — Method

ismultilogiseed(dataset)::Bool

See islogiseed.

source

SoleData.naturalgrouping — Method

naturalgrouping(
    X::AbstractDataFrame;
    allow_variable_drop = false,
)::AbstractVector{<:AbstractVector{<:Symbol}}

Return variables grouped by their logical nature; the nature of a variable is automatically derived from its type (e.g., Real, Vector{<:Real} or Matrix{<:Real}) and frame. All instances must have the same frame (e.g., channel size/number of worlds).

source

SoleData.scalaralphabet — Method

scalaralphabet(a::AbstractAlphabet{<:ScalarCondition}, args...; kwargs...)

TODO explain args and kwargs...

sorted: whether to sort the atoms in the sub-alphabets (i.e., the threshold domains), by a truer-first policy (default: true)
test_operators: test operators to use (defaulted to [≤, ≥] for real-valued features, and [(==), (≠)] for other features, e.g., categorical)

Return a MultivariateScalarAlphabet from an alphabet of ScalarCondition's.

source

SoleData.scalarlogiset — Function

scalarlogiset(dataset, features; kwargs...)

Convert a dataset structure (with variables) to a logiset with scalar-valued features. Refer to islogiseed for the interface that dataset must adhere to.

Arguments

dataset: the dataset that will be transformed into a logiset. It should adhere to the islogiseed interface;
features: vector of features, corresponding to dataset columns;

Keyword Arguments

use_onestep_memoization::Union{Bool,Type{<:AbstractOneStepMemoset}}=!isnothing(conditions) && !isnothing(relations):

enable one-step memoization, optimizing the checking of specific, short formulas using specific scalar conditions and relations (see AbstractOneStepMemoset);

conditions::Union{Nothing,AbstractVector{<:AbstractCondition}}=nothing:

a set of conditions or metaconditions to be used in one-step memoization. If not provided, metaconditions given by minimum and maximum applied to each variable will be used (see ScalarMetaCondition);

relations::Union{Nothing,AbstractVector{<:AbstractRelation}}=nothing:

a set of relations to be used in one-step memoization (see AbstractRelation);

onestep_precompute_globmemoset::Bool = (use_onestep_memoization != false):

precompute the memoization set for global one-step formulas. This usually takes little time: in facto, because, global formulas are grounded, the intermediate check result does not depend on the number of worlds.

onestep_precompute_relmemoset::Bool = false:

precompute the memoization set for global one-step formulas. This may take a long time, depending on the relations and the number of worlds; it is usually not needed.

use_full_memoization::Union{Bool,Type{<:Union{AbstractOneStepMemoset,AbstractFullMemoset}}}=true:

enable full memoization, where every intermediate check result is cached to avoid recomputing. This can be used in conjunction with one-step memoization;

print_progress::Bool = false: print a progress bar;
allow_propositional::Bool = false: allows a tabular (i.e, non-relational) dataset to be instantiated as a PropositionalLogiset, instead of a modal logiset;
force_i_variables::Bool = false: when conditions are to be inferred (conditions = nothing), force (meta)conditions to refer to variables by their integer index, instead of their Symbol name (when available through varnames, see islogiseed).

Logiseed-specific Keyword Arguments

worldtype_by_dim::AbstractDict{<:Integer,<:Type} = Dict([0 => OneWorld, 1 => Interval, 2 => Interval2D]):

When the dataset is a MultiData.AbstractDimensionalDataset, this map between the dimensionality and the desired AbstractWorld type is used to infer the frame type. By default, dimensional datasets of dimensionalities 0, 1 and 2 will generate logisets based on OneWorld, Interval's, and Interval2D's, respectively.

Examples

julia> df = DataFrame(A = [36, 37, 38], B = [1, 2, 3])
3×2 DataFrame
 Row │ A      B
     │ Int64  Int64
─────┼──────────────
   1 │    36      1
   2 │    37      2
   3 │    38      3

julia> scalarlogiset(df; worldtype_by_dim=([0=>OneWorld]))
SupportedLogiset with 1 support (2.21 KBs)
├ worldtype:                   OneWorld
├ featvaltype:                 Int64
├ featuretype:                 VariableValue
├ frametype:                   SoleLogics.FullDimensionalFrame{0, OneWorld}
├ # instances:                 3
├ usesfullmemo:                true
├[BASE] UniformFullDimensionalLogiset of dimensionality 0 (688.0 Bytes)
│ ├ size × eltype:              (3, 2) × Int64
│ └ features:                   2 -> VariableValue[V1, V2]
└[SUPPORT 1] FullMemoset (0 memoized values, 1.5 KBs))

julia> pointlogiset = scalarlogiset( Xdf; worldtypeby_dim=Dict([1 => SoleLogics.Point1D, 2 => SoleLogics.Point2D]) )

See also AbstractModalLogiset, AbstractOneStepMemoset, SoleLogics.AbstractRelation, SoleLogics.AbstractWorld, ScalarCondition, VarFeature.

source

SoleData.AbstractScalarOneStepGlobalMemoset — Type

Abstract type for one-step memoization structure for checking "global" formulas of type ⟨G⟩ (f ⋈ t). We refer to these structures as global memosets.

source

SoleData.ScalarOneStepRelationalMemoset — Type

A generic, one-step memoization structure used for checking specific formulas of scalar conditions on datasets with scalar features. The formulas are of type ⟨R⟩ (f ⋈ t)

source

SoleData.PropositionalLogiset — Type

PropositionalLogiset(table) <: AbstractPropositionalLogiset

A logiset of propositional interpretations, wrapping a Tables' table of real/string/categorical values.

Examples

This structure can be used to check propositional formulas:

using SoleData, MLJBase

X = PropositionalLogiset(MLJBase.load_iris())

φ = parseformula(
    "sepal_length > 5.8 ∧ sepal_width < 3.0 ∨ target == "setosa"";
    atom_parser = a->Atom(parsecondition(SoleData.ScalarCondition, a; featuretype = SoleData.VariableValue))
)

check(φ, X, 10) # Check the formula on a single instance

satmask = check(φ, X) # Check the formula on the whole dataset

slicedataset(X, satmask)
slicedataset(X, (!).(satmask))

source

SoleLogics.alphabet — Function

alphabet(
    X::PropositionalLogiset,
    sorted=true;
    test_operators::Union{Nothing,AbstractVector{<:TestOperator},Base.Callable}=nothing,
    discretizedomain=false,
    y::Union{Nothing, AbstractVector}=nothing,
)::MultivariateScalarAlphabet

Constructs an alphabet based on the provided PropositionalLogiset X, with optional parameters:

sorted: whether to sort the atoms in the sub-alphabets (i.e., the threshold domains), by a truer-first policy (default: true)
test_operators: test operators to use (defaulted to [≤, ≥] for real-valued features, and [(==), (≠)] for other features, e.g., categorical)
discretizedomain: whether to discretize the domain (default: false)
y: vector used for discretization (required if discretizedomain is true)

Returns a UnionAlphabet containing ScalarCondition and UnivariateScalarAlphabet.

source

Scalar Dimensional Logisets

SoleData.DimensionalDatasets.UniformFullDimensionalLogiset — Type

struct UniformFullDimensionalLogiset{
    U,
    W<:AbstractWorld,
    N,
    D<:AbstractArray{U},
    FT<:AbstractFeature,
    FR<:FullDimensionalFrame{N,W},
} <: AbstractUniformFullDimensionalLogiset{U,N,W,FT,FR}

Uniform scalar logiset with full dimensional frames of dimensionality N, storing values for each world in a ninstances × nfeatures array.

The size of the internal structure (or featstruct) depends on the (unique) world type considered.

Examples

Interval-based frames

With an interval-based, N-dimensional frame, the worlds are N-intervals, and have 2*N parameters, which are used to index an (N*2+2)-dimensional featstruct (recall that two dimensions are reserved for instances and features).

For example, consider the case of a 1-dimensional frame with three points: 1 2 3 ─────────────────────────

Given an instance and a feature, the featstruct will map the hyper-intervals across two dimensions: ┌───┬───────┬───────┬───────┐ │ │ 1 │ 2 │ 3 │ ├───┼───────┼───────┼───────┤ │ 1 │ [1,1] │ [1,2] │ [1,3] │ ├───┼───────┼───────┼───────┤ │ 2 │ │ [2,2] │ [2,3] │ ├───┼───────┼───────┼───────┤ │ 3 │ │ │ [3,3] │ └───┴───────┴───────┴───────┘

See also AbstractModalLogiset, AbstractUniformFullDimensionalLogiset, SoleLogics.FullDimensionalFrame.

source

SoleData.DimensionalDatasets.UniformFullDimensionalOneStepRelationalMemoset — Type

A relational memoset optimized for uniform scalar logisets with full dimensional frames of dimensionality N, storing values for each world in a ninstances × nmetaconditions × nrelations array. Each world is a hyper-interval, and its N*2 components are used to index different array dimensions, ultimately resulting in a (N*2+3)-dimensional array.

source

SoleData.initlogiset — Method

function initlogiset(
    dataset::AbstractDimensionalDataset,
    features::AbstractVector;
    worldtype_by_dim::Union{Nothing,AbstractDict{<:Integer,<:Type}}=nothing
)::UniformFullDimensionalLogiset

Given an AbstractDimensionalDataset, build a UniformFullDimensionalLogiset.

Keyword Arguments

worldtypebydim::Union{Nothing,AbstractDict{<:Integer,<:Type}}=nothing:

map between a dimensionality, as integer, and the AbstractWorld type associated; when unspecified, this is defaulted to Dict(0 => OneWorld, 1 => Interval, 2 => Interval2D).

See also AbstractDimensionalDataset, SoleLogics.AbstractWorld, MultiData.dimensionality, UniformFullDimensionalLogiset.

source

Multimodal Logisets

SoleData.MultiFormula — Type

struct MultiFormula{F<:Formula} <: AbstractSyntaxStructure
    modforms::Dict{Int,F}
end

A logical formula that can be checked on a MultiLogiset, representing the logical and between formulas across different modalities.

source

SoleData.MultiLogiset — Type

struct MultiLogiset{L<:AbstractLogiset}
    modalities  :: Vector{L}
end

A logical dataset composed of different modalities); this structure is useful for representing multimodal datasets in logical terms.

Optimizations

Representatives

SoleData.representatives — Method

representatives(
    fr::AbstractFrame{W},
    S::W,
    ::AbstractRelation,
    ::AbstractCondition
) where {W<:AbstractWorld}

Return an iterator to the (few) representative accessible worlds that are necessary for computing and propagating truth values through existential modal connectives. When this optimization is possible (e.g., when checking specific formulas on scalar conditions), it allows to further boost "one-step" optimizations (see AbstractOneStepMemoset).

For example, consider a Kripke structure with a 1-dimensional FullDimensionalFrame of length 100, and the problem of checking a formula "⟨L⟩(max[V1] ≥ 10)" on a SoleLogics.Interval SoleLogics.Interval{Int64}(1, 2) (with L being Allen's "Later" relation, see SoleLogics.IA_L). Comparing 10 with the (maximum) "max[V1]" computed on all worlds is the naïve strategy to check the formula. However, in this case, comparing 10 to the "max[V1]" computed on the single Interval SoleLogics.Interval{Int64}(2, 101) suffice to establish whether the structure satisfies the formula. Similar cases arise depending on the relation, feature and test operator (or, better, its aggregator).

Note that this method fallsback to accessibles.

source