SeLeCt - Structures, Learning and Cognition

Publications

Authors Title
M. Bresler, D. Průša, V. Hlaváč Online recognition of sketched arrow-connected diagrams
International Journal on Document Analysis and Recognition, Vol. 19 (3), 2016, pp. 253-267.
Abstract: We introduce a new, online, stroke-based recognition system for hand-drawn diagrams which belong to a group of documents with an explicit structure obvious to humans but only loosely defined from the machine point of view. We propose a model for recognition by selection of symbol candidates, based on evaluation of relations between candidates using a set of predicates. It is suitable for simpler structures where the relations are explicitly given by symbols, arrows in the case of diagrams. Knowledge of a specific diagram domain is used—the two domains are flowcharts and finite automata. Although the individual pipeline steps are tailored for these, the system can readily be adapted for other domains. Our entire diagram recognition pipeline is outlined. Its core parts are text/non-text separation, symbol segmentation, their classification and structural analysis. Individual parts have been published by the authors previously and so are described briefly and referenced. Thorough evaluation on benchmark databases shows the accuracy of the system reaches the state of the art and is ready for practical use. The paper brings several contributions: (a) the entire system and its state-of-the-art performance; (b) the methodology exploring document structure when it is loosely defined; (c) the thorough experimental evaluation; (d) the new annotated database for online sketched flowcharts and finite automata diagrams.
Links: BibTeX, DOI
M. Bresler, D. Průša, V. Hlaváč Recognizing Off-line Flowcharts by Reconstructing Strokes and Using On-line Recognition Techniques
In: Proceedings of the International Conference on Frontiers in Handwriting Recognition (ICFHR 2016), Shentzen, China, October 23-26, 2016, pp. 48-53.
Abstract: We experiment with off-line recognition of handwritten flowcharts based on strokes reconstruction and our state-of-the-art on-line diagram recognizer. A simple baseline algorithm for strokes reconstruction is presented and necessary modifications of the original recognizer are identified. We achieve very promising results on a flowcharts database created as an extension of our previously published on-line database.
Links: BibTeX, DOI
L. Krtek, F. Mráz Two-Dimensional Limited Context Restarting Automata
In: Fundamenta Informaticae, Vol. 148 (3-4), 2016, pp. 309-340.
Abstract: Motivated by possible machine learning of picture languages, we introduce a new two-dimensional automaton called two-dimensional limited context restarting automaton. Our model is a simplification of the two-dimensional restarting tiling automaton, from which it differs in that it does not require to scan input pictures in a fixed order. We show that the two-dimensional limited context restarting automaton is equally powerful as the two-dimensional sgraffito automaton. Moreover, the correctness preserving version of the new model, while still being nondeterministic, is equivalent to the deterministic sgraffito automaton. However, the property of being correctness preserving is not even semi-decidable for two-dimensional limited context restarting automata.
Links: BibTeX, DOI
F. Mráz, F. Otto, D. Průša Two-Dimensional Limited Context Restarting Automata
RAIRO - Theoretical Informatics and Applications, 2016, Cambridge University Press, publication ahead of print, preprint published online on December 7, 2016
Abstract: With the aid of homogeneous morphisms, we turn the deterministic two-dimensional two-way ordered restarting automaton and its extended variant into devices that compute transductions of pictures, and we study the resulting classes of transductions in detail.
Links: BibTeX, DOI
I. Mrázová, P. Zvirinský Extraction and Interpretation of Textual Data from Czech Insolvency Proceedings (Technical Report)
TR No. 2016/1/KTIML, December 2016, 11 p.
Abstract: Recently, the Czech Insolvency Register covers about 200 000 insolvency proceedings commenced since 2008. In order to better assess the real impact of indebtedness across the Czech society, the data about the creditors or the reasons for the debt might be of great value. Unfortunately, the vast majority of such information is contained only in scanned document copies attached to the insolvency proceedings. Our goal is thus to extract textual data from the scanned pdf-files and to find the wanted information in the obtained texts.
Links: Address requests for download to: I. Mrázová
F. Otto, F. Mráz Cyclically extended variants of Sgraffito and restarting automata for picture languages
In: H. Bordihn, R. Freund, B. Nagy, G. Vaszil (Eds.): Proceedings of the Eighth Workshop on Non-Classical Models of Automata and Applications (NCMA 2016), šsterreichische Computer Gesellschaft, vol. 321 of books@ocg.at, 2016, pp. 259-273.
Abstract: Here we introduce and study cyclic extensions of deterministic Sgraffito automata and of deterministic two-dimensional two-way ordered restarting automata for picture languages. Such a cyclically extended automaton can move in a single step from the last column (or row) of a picture to the first column (or row). For deterministic Sgraffito automata, we show that this cyclic extension does not increase the expressive power of the model, while for deterministic two-dimensional two-way restarting automata, the expressive power is strictly increased by allowing cyclic moves. In fact, for the latter automata, we take the number of allowed cyclic moves in any column or row as a parameter, and we show that already with a single cyclic move per column (or row) the deterministic two-dimensional extended two-way restarting automaton can be simulated. On the other hand, we show that two cyclic moves per column or row already give the same expressive power as any finite number of cyclic moves. Finally, we show that for one-row pictures (that is, strings), already one cyclic move suffices.
Links: BibTeX
F. Otto, F. Mráz Regulated variants of limited context restarting automata
Theoretical Compute Science, vol. 682, pp. 190-207, 2016.
Abstract: In the literature various types of restarting automata have been studied that are based on contextual rewriting. A word w is accepted by such an automaton if, starting from the initial configuration that corresponds to input w, the tape contents is reduced to a particular word within a finite number of applications of these contextual rewritings. Here we extend the limited context restarting automata by using additional global means to structure the reductions they execute. In fact, we study regulated lc-R-automata, for which a regular control language is used to restrict the set of admissible reduction sequences, and we propose random context conditions to restrict the place at which a transition of an lc-R-automaton can be applied.
Links: BibTeX, DOI
M. Plátek, K. Oliva Redukční analýza A-stromů s minimalistickými omezeními
In: Proceedings of the 16th ITAT Conference on Information Technologies - Applications and Theory, Tatranské Matliare, Slovakia, September 15-19, 2016, pp. 26-33.
Abstract: Tento příspěvek navazuje na náš lonský příspěvek na ITATu. Zpracovává novým způsobem redukční analýzu na A-stromech, které jsou formalizací stromů, zpracovaných metodikou pro analytickou rovinu Pražského závislostního korpusu (PDT). Redukční analýza A-stromů sestává z minimálních korektních redukcí, které používají pouze elementární operace delete a shift. Hlavním cílem je vyvinout formální prostředky, které by exaktně zachycovaly lingvisticky pozorované minimalistické vlastnosti jednotlivých parametrů stromové redukční analýzy stromů ve formátu PDT a dovolily následně realizovat podobná pozorování na různých přirozených, či umělých jazycích. Pomocí pozorování lingvistického typu upřesnujeme strukturálně-složitostní vlastnosti A-stromů se závislostmi a koordinacemi. Zvýraznujeme vlastnosti, kterými se závislosti a koordinace liší.
Links: BibTeX, Publication
D. Průša Complexity of Sets of Two-Dimensional Patterns
In: Y.-S. Han, K. Salomaa (Eds.): Proceedings of the 21st International Conference on Implementation and Application of Automata (CIAA 2016), LNCS, Vol. 9705, 2016, pp. 236-247.
Abstract: We study the two-dimensional pattern matching implemented using the two-dimensional on-line tessellation automaton, which is a restricted type of the cellular automaton able to simulate the Baker- Bird algorithm, proposed as the first algorithm for the two-dimensional pattern matching. We further explore capabilities of this automaton to carry out the matching task against an arbitrary set of equally sized patterns. To measure amount of resources needed to accomplish it, we introduce the pattern complexity of a picture language. We show that this complexity spreads in a wide range. It is demonstrated by giving examples, deriving general techniques and proving some lower bounds.
Links: BibTeX, DOI
D. Průša, K. Reinhardt Undecidability of the emptiness problem for context-free picture languages
Theoretical Computer Science, 2016, in print.
Abstract:
Links: BibTeX, DOI
J. van Leeuwen, J. Wiedermann Knowledge, Representation, and the Dynamics of Computation
In: G. Dodig-Crnkovic, R. Giovagnoli (Eds.): Representation and Reality: Humans, Animals and Machines. Studies in Applied Philosophy, Epistemology and Rational Ethics, Vol. 28, Springer, 2017.
Abstract: Cognitive processes are often modelled in computational terms. Can this still be done if only minimal assumptions are made about any sort of representation of reality? Is there a purely knowledge-based theory of computation that explains the key phenomena which are deemed to be computational in both living and artificial systems as understood today? We argue that this can be done by means of techniques inspired by the modelling of dynamical systems. In this setting, computations are defined as curves in suitable metaspaces and knowledge is generated by virtue of the operation of the underlying mechanism, whatever it is. Desirable properties like compositionality will be shown to fit naturally. The framework also enables one to formally characterize the computational behaviour of both knowledge generation and knowledge recognition. The approach may be used in identifying when processes or systems can be viewed as computational in general. Several further questions pertaining to the philosophy of computing are considered.
Links: BibTeX
J. Vojt Deep neural networks and their implementation
Diploma thesis, Faculty of Mathematics and Physics, Charles University, Prague, 2016, 96 p.
Abstract: Deep neural networks represent an effective and universal model capable of solving a wide variety of tasks. This thesis is focused on three different types of deep neural networks – the multilayer perceptron, the convolutional neural network, and the deep belief network. All of the discussed network models are implemented on parallel hardware, and thoroughly tested for various choices of the network architecture and its parameters. The implemented system is accompanied by a detailed documentation of the architectural decisions and proposed optimizations. The efficiency of the implemented framework is confirmed by the results of the performed tests. A significant part of this thesis represents also additional testing of other existing frameworks which support deep neural networks. This comparison indicates superior performance to the tested rival frameworks of multilayer perceptrons and convolutional neural networks. The deep belief network implementation performs slightly better for RBM layers with up to 1000 hidden neurons, but has a noticeably inferior performance for more robust RBM layers when compared to the tested rival framework.
Links: BibTeX, Thesis
D. Průša Non-Recursive Trade-offs between Two-Dimensional Automata and Grammars
Theoretical Computer Science, Vol. 610/A, January 2016, pp. 121–132.
Abstract: We study succinctness of descriptional systems for picture languages. Basic models of two-dimensional finite automata and generalizations of context-free grammars are considered. They include the four-way automaton of Blum and Hewitt, the two-dimensional online tessellation automaton of Inoue and Nakamura and the context-free Kolam grammar of Siromoney et al. We show that non-recursive trade-offs between the systems are very common. Basically, each separation result proving that one system describes a picture language which cannot be described by another system can usually be turned into a non-recursive trade-off result between the systems. These findings are strongly based on the ability of the systems to simulate Turing machines.
Links: BibTeX, DOI
J. Wiedermann, J. van Leeuwen Understanding and Controlling Artificial General Intelligent Systems
Submitted to The 10th AISB Symposium on Computing and Philosophy: Language, cognition, and computation, AISB Convention, University of Bath, UK, 19-21 April 2017 (in review).
Abstract: Artificial general intelligence (AGI) systems are advancing in all parts of our society. The potential of autonomous systems that surpass the capabilities of human intelligence has stirred debates everywhere. How should ‘super-intelligent’ AGI systems be viewed so they can be feasibly controlled? We approach this question based on the viewpoints of the epistemic philosophy of computation, which treats AGI systems as computational systems processing knowledge over some domain. Rather than considering their autonomous development based on ‘self-improving software’, as is customary in the literature about super-intelligence, we consider AGI systems as operating with ‘self-improving epistemic theories’ that automatically increase their understanding of the world around them. We outline a number of algorithmic principles by which the self-improving theories can be constructed. Then we concentrate on the problem of aligning the behavior of AGI systems with human values in order to make such systems safe. This issue arises concretely when one studies the social and ethical aspects of human-robot interaction in advanced AGI systems as they exist already today. Necessarily, the behavior of any such system will depend on its own world-view. We argue that there is no ‘universal’ AGI system whose values will always agree with the values of any other system. Finally, based on the principles of interactive proof systems, we design an architecture of AGI systems and an interactive scenario that will enable one to detect in their behavior deviations from the prescribed goals. The conclusions from our analysis of super-intelligent systems temper the over-optimistic expectations and over-pessimistic fears of singularity believers, by grounding the ideas on super-intelligent AGI systems in more realistic foundations.
Links: BibTeX
F. Mráz, F. Otto, and D. Průša Some classes of rational functions for pictures
RAIRO - Theoretical Informatics and Applications, vol. 50, no. 4, pp. 351-369, 2016.
Abstract: With the aid of homogeneous morphisms, we turn the deterministic two-dimensional two-way ordered restarting automaton and its extended variant into devices that compute transductions ofpictures, and we study the resulting classes of transductions in detail.
Links: BibTeX, DOI

Last updated: Sunday, 02/18/18