• Shuffle
    Toggle On
    Toggle Off
  • Alphabetize
    Toggle On
    Toggle Off
  • Front First
    Toggle On
    Toggle Off
  • Both Sides
    Toggle On
    Toggle Off
  • Read
    Toggle On
    Toggle Off
Reading...
Front

Card Range To Study

through

image

Play button

image

Play button

image

Progress

1/34

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

34 Cards in this Set

  • Front
  • Back
what is the idea of inductive inference?
inferring a general rule from a set of examples
what is the idea identification in the limit?
guessing the correct rule after a certain number of examples
what is the idea of identification by enumeration?
exhaustively searching the space of rules by listing all possible rules then trying them one after another
what is the practical problem of identification by enumeration?
it usually has a search space exponential in the length of the description for the correct rule
what is the first thing that must be specified in order to define an inductive inference problem?
the class of rules being considered (usually a class of functions or languages (where a language is a set of words))

another way to think about this is a class of sets from which our set of examples might belong to
why do the class of rules need to be specified?
to prevent rules that cannot be learned e.g. in a sequence prediction problem where one must guess the nth term of a sequence given 1,...,n-1 terms

if the rule is the nth term is the nth guess +1 then this rule is unlearnable
what is an example of a rule classes where the sequence of data consists of integers?
the class of rules (or rule space) may be functions f:Z->Z where f(n) is a fixed polynomial with integer coefficients
what is an example of a rule class where the sequence of data consists of bit strings
the rule space might be the regular sets over the alphabet {0,1}
what is the second thing that must be specified in order to define an inductive inference problem?
the hypothesis space, i.e., a set of descriptions where each rule in the class has at least one description in the hypothesis space

in terms of the rule space being a class of sets, they hypothesis space can be thought of as machines/programs that compute a set in the rule class
what are examples of a hypothesis space when the rule class is regular sets over the alphabet {1,1}
regular expressions or context free grammars
what is an example of a hypothesis space when the rule class is functions f:z->z that are integer polynomials in a single variable
the hypothesis space is essentially the same as the rule class, but we could also presumably use some other representation of polynomial functions e.g. a neural network
what is the third thing that must be specified in order to define an inductive inference problem?
for each rule its set of examples and the sequences of examples that constitute admissible presentations of the rule
what is an admissible presentation of a rule?
a examples that hold true for the rule
what is the fourth thing that must be specified in order to define an inductive inference problem?
the class of inference methods
what is the fifth thing that must be specified in order to define an inductive inference problem?
criteria for successful inference
why can't learning from a program trace be reduced to learning input output pairs of some function?
because if one wants to formulate the steps of the trace as input/output pairs of the transition function there is a missing argument of control state
what is an acceptor?
a finite state machine
what is a canonical acceptor for a language L?
it is the acceptor with minimal number of states
what is the prefix tree acceptor of a set of positive examples from a language i.e. a subset?
an acceptor that accepts S
what is an example of using the organization or structure of a hypothesis space to do practical inductive inference where the domain is finite state acceptors?
if one is learning minimal finite state acceptors then starting from the canonical tree acceptor of a set of examples one can search through state machines that have merged states rather than enumerate through all finite state machines
how is organization of search space useful for practical inductive inference?
properties of the hypothesis space can be used to avoid search through enumeration

one hypothesis may give information on another hypothesis (e.g. due to similarity) and this means if one is wrong it may be possible to not have to look at the other
what is basic idea of subsumption and version spaces as applied to organizing a search space?
if the set that corresponds to a hypothesis does not contain a positive example of the true set then any refinement or more specific hypothesis will also not capture the example and thus does not need to be examined conversely if a negative example is captured by a hypothesis then generalizations of the hypothesis do not need to be examined
what is the version space algorithm?
maintain a set of the most general hypotheses that do not capture any negative examples

also maintain the most specific hypotheses that contain all the positive examples

as more examples are received the hypotheses compatible in both the general set and the specific set will shrink and these hypotheses are the desired ones
what is the idea of pruning by forbidden features when it comes to practical inductive inference
one can identify part of a hypothesis as the root cause of its failure on many examples and thus avoid examining any hypothesis that has this part
how was forbidden feature idea applied to searching the space of context-free grammars (CFGs)?
CFGs are enumerated in increasing order of complexity

starting from the first a grammar is tested against a sample, if the grammar fails a routine determines the first production in the grammar that can be changed to avoid failure

the search through the enumerated list of grammars skips over any following grammar that contains the production (grammars close together in the enumeration should be similar and so some will probably have this production and be skipped)
what is the idea of jumping to conclusions or constructive methods?
usually heuristic search
the opposite of pruning systematic search. basically using the data and information about the hypothesis space to jump to a hypothesis even if it means skipping over other possibly consistent hypotheses
what are the downsides of constructive algorithms?
they usually don't have good theoretical descriptions of the optimality of their solutions
what is the constructive algorithm of k-tails, used for inferring non-deterministic finite-state transducers?
a parameter k is used to heuristically determine if states in a machine should be the same, by seeing if their behavior on strings of length less than k is the same.

if states behave the same they are identified(collapsed/merged?) creating a new transducer
what are some inductive problems that have constructive algorithm solutions with guarantees?
learning context-free grammars from bracketed examples

inferring pattern languages

inferring k-reversible languages
what is the learning context free grammars from bracketed examples problem?
learning a grammar given examples of the language along with unlabeled parse trees i.e. trying to identify internal labels for the nodes in the example parse trees
what are pattern languages?
a pattern language is a set of strings that can be generated by a string containing constants and variables (a pattern)
what are k-reversible languages?
a proper subclass of regular languages where if two strings have a same middle part of length k then any other strings with a different suffix after the middle part are also in the language
how do the systems of Hardy and Shaw approach inference of LISP programs?
they start with a schema program and use data to fill in the details of the schema
what is the summers approach to program induction?
the approach uses a recursive schemata and fills it in by rewriting the output in terms of cars, cdrs, and cons applied to the input and then a recursive relation is searched for within these basic list expressions

he also has a method for introducing auxiliary variables to build more complicated functions