Author Archives: admin

Stereo vision algorithms for dense 3D reconstruction: introduction and recent developments

Stefano Mattoccia, of DEIS will held a lecture at MICC entitled “Stereo vision algorithms for dense 3D reconstruction: introduction and recent developments”.

The lecturer Stefano Mattoccia, DEIS, University of Bologna

The lecturer Stefano Mattoccia, DEIS, University of Bologna

The stereo vision enables the 3D reconstruction of scenes observed by two or more cameras. In this seminar, considering the case of dense 3D reconstructions, the main problems of stereo vision will be introduced and recent developments in this area will be examined with particular reference to algorithms that lend themselves to being mapped on devices with parallel processing capabilities (eg FPGA, GPU).

The MICC at the MatchMaking ICT Florence

The ICT / Robotics MatchMaking is a day of bilateral meetings between companies and university laboratories organized by CsaVRI – University of Florence. The event will be attended  by companies interested in meeting the associated laboratories at the University of Florence in the ICT and Robotics in order to enrich their knowledge and the emergence of collaboration in innovation projects.

The MICC will be at the MatchMaking ICT/Robotics in Florence

The event aims to facilitate meetings between companies and university laboratories in the ICT and Robotics fields. It will be a chance of meeting between the interested companies and research laboratories related to the University of Florence, in the ICT and Robotics. Each company can meet face to face with the members of the laboratories that are of interest for their business. Each meeting will last 30 minutes with the objective of capturing the potential for the emergence of collaborations on research projects.

ToscanaIN and MICC Workshop “The multisensorial design” at CNA Next Week

ToscanaIN with CNA contributes to the CNA NeXT Week: a week for ‘making the future’.

In 2012, CNA NeXT arrives at the International Handicrafts Trade Fair in Florence (21-29 April) and organizes a week full of innovation initiatives for young people on collaborative work, collective intelligence, digital training, digital crafts and new media.

The multisensorial design workshop will be held at the International Handicrafts Trade Fair in Florence

The multisensorial design workshop will be held at the International Handicrafts Trade Fair

ToscanaIN contributes to CNA NeXT Week with three workshops (free for those accessing the Fair – ticket € 4.00 on weekdays) that take place in the Hall “Making Future” of the Arena “Digital Makers”.

On Tuesday, April 24, from 5PM to 7PM it takes place “The multisensorial design: journey into the world of the 5 senses”, workshop by ToscanaIN in collaboration with MICC (Media Integration and Communication Center) of University of Florence.

The workshop investigates the impact that design has on our senses every day, bringing together the experiences of professionals who operate internationally who will describe the results of unexpected and innovative design and present some projects / products they designed.

Gianpaolo D’Amico of MICC coordinates the workshop.

Program and speakers

  • Colour Design by Di Stefano Giovacchini of Disegno Design
  • Sound Design – Sara Lenzi of Lorelei
  • Product-design – Dimitris Zoz of Atelierzoz
  • Interaction design – Lea Landucci of MICC

VIVIT. Vivi l’Italiano web portal

VIVIT is a three-years project led by Media Integration and Communication Center (MICC) and Accademia della Crusca, funded on government FIRB funding. As a part of this project, the VIVIT web portal has been developed by MICC in order to give visibility to culture-related contents that may appeal to second and third generation Italians living abroad.

Vivit web portal

Vivit web portal

The main aim of the VIVIT web portal is to provide people of italian origins with quality content related to the history of the nation and that of the language, together with learning materials for self-assessment and improvement of the viewer’s language proficiency.

The development of the VIVIT web portal has officially started in 2010, when the information architecture and content organization were first discussed. The VIVIT project stated that the web portal should give users and potential teachers ways to interact with each other and to produce and reorganize contents to be shown online to language and culture learners. Given these premises, it was decided to make use of a CMS (Content Management System), the possibility of user roles definition and interaction being part of its nature.

VIVIT is being developed on Drupal. Free and open-source PHP-based software, Drupal has come a long way over recent years in features development and is now considered one of the best CMS systems together with the well-known WordPress and Joomla. A large amount of user-contributed plugins (modules, in Drupal terms) and layout themes is available, since the development process itself is relatively simple and widely documented.

At this time, the architecture of the VIVIT portal is mostly complete: users may browse content, comment on it, bookmark pages and reorganize them from inside the platform (users with the role of teachers may also share these self-created content units with other users, to create their own learning path through the contents of the web portal); audio and video resources are available as well as learning materials that allow user interaction granted by the use of a custom jQuery plugin developed internally at MICC.

It is also possible, for users with enough rights, to semantically process and annotate (that is, assign resources that describe the content) texts inside the portal by using the named entities and topic extraction servlet Homer, also developed at MICC: the tagging possibility is part of Drupal core modules, while the text analysis feature is a combination of the contributed tagging module and a custom module written specifically for the VIVIT portal. The Homer servlet is a Java application based on GATE, a toolkit for a broad range of NLP (Natural Language Processing) tasks.

LIT. Lexicon of Italian Television search engine

LIT. Lexicon of Italian Television search engine

The VIVIT web portal gives access to additional resources related to the same cultural field: in particular LIT (Lexicon of Italian Television) and LIR (Lexicon of Italian Radio). The former, LIT, is a Java search engine that uses Lucene in order to index about 160 video excerpts from Italian TV programs of about 30 minutes each, chosen from the RAI video archive. LIT also offers a backend system where it is possible to stream the video sequences, synchronize the transcriptions with the audio-video sources, annotate the materials by means of customized taxonomies and furthermore add specific metadata. The latter, LIR, is a similar system that relies on an audio archive composed of radio segments from several Italian sources. Linguists are currently using LIT and LIR for computational linguistics based research.

LIR. Lexicon of Italian Radio backend

LIR. Lexicon of Italian Radio backend

A coarse-to-fine approach for fast deformable object detection

Marco Pedersoli will present a method that can dramatically accelerate object detection with part based models. The method is based on the observation that the cost of detection is likely to be dominated by the cost of matching each part to the image, and not by the cost of computing the optimal configuration of the parts as commonly assumed. Therefore accelerating detection requires minimizing the number of part-to-image comparisons.

Coarse-to-fine inference

A method for the fast inference of multi-resolution part based models. (a) example detections; (b) scores obtained by matching the lowest resolution part (root filter) at all image locations; (c) scores obtained by matching the intermediate resolution parts, only at location selected based on the response of the root part; (d) scores obtained by matching the high resolution parts, only at locations selected based on the intermediate resolution scores

To this end we propose a multiple-resolutions hierarchical part based model and a corresponding coarse-to-fine inference procedure that recursively eliminates from the search space unpromising part placements. The method yields a ten-fold speedup over the standard dynamic programming approach and is complementary to the cascade-of-parts approach. Compared to the latter, our
method does not have parameters to be determined empirically, which simplifies its use during the training of the model. Most importantly, the two techniques can be combined to obtain a very significant speedup, of two orders of magnitude in some cases.

We evaluate our method extensively on the PASCAL VOC and INRIA datasets, demonstrating a very high increase in the detection speed with little degradation of the accuracy.

Inauguration of Urban Innovation Park of the City of Florence

Next Thursday March 15 there will be the inauguration of the new Urban Park for Innovation, housed in the renovated former prison area, “The Murate”.

Le Murate

Le Murate

The project of the city council of Florence aims to create a technological district for the promotion of cultural heritage in the area of Florence.

In the Urban Park for Innovation new and already existing companies involved in humanities and technologies will have the opportunity to meet each other and to take advantage of various networking facilities, in order to promote an union between an innovative use of technology and culture.

The Media Integration and Communication Center is one of the centers that will be housed in the park. Its Director, Professor Alberto Del Bimbo will present the activities of center during the meeting.

Furthermore, during the day, the space assigned to the center will also host some installations dedicated to cultural heritage that will be set up in collaboration with Nicola Torpei, owner of one of the companies that will be hosted in the park and former employee at the MICC in the field of natural interaction.

Read more on Comune di Firenze website

MICC cluster presentation

Tiberio Uricchio presented today a short technical tutorial regarding the use of the cluster installed at MICC.

Tiberio Uricchio

Tiberio Uricchio

The tutorial presented the overall structure and setup of the system, the use of Sun Grid Engine and parallelization of Matlab scripts.

You can read the resentation below (in italian)

On automatic reading of handwriting (What the writing hand could tell to the reading eye)

Prof. Angelo Marcelli will present a principled approach to automatic cursive handwriting reading. The approach is based upon handwriting generation models, and according to them assumes that handwriting is a learned complex motoric task which is accomplished by sequencing simpler movement called stroke.

Automatic reading of cursive handwriting

Automatic reading of cursive handwriting

As learning proceeds in human, so does fluency, which results in producing similar sequence of strokes in correspondence of sequence of letters. Such invariants represents therefore the basic drawing units to which an interpretation can be associated. Reading is then achieved by detecting the invariants used to produce the word to be recognized, associating to them their interpretations, and eventually concatenating the interpretations along the ink of the word, without explicit segmentation in characters, as in case of analytical approaches, or using dictionary of possible words, as in case of holistic methods.

He will conclude with a list of key issues to be addressed while pursuing the proposed approach and the steps we have undertaken along this path.

Vehicles that Learn by Observing How We Drive: Multidisciplinary Explorations in Human-Centered Driver Assistance

Understanding driver behavior and ethnography surrounding the task of driving are essential in the development of human-centric driver assistance systems.

Laboratory for Intelligent and Safe Automobiles University of California at San Diego

Laboratory for Intelligent and Safe Automobiles University of California at San Diego

Novel instrumented vehicles are used for conducting experiments, where the rich contextual information about vehicle dynamics, surround and driver state are captured for careful, detailed ethnographic studies, as well as realistic data for developing algorithms to analyze multi sensory signals for active safety.

In this presentation, Prof. Mohan M. Trivedi will provide a systems- oriented framework for developing multimodal sensing, inferencing algorithms and human-vehicle interfaces for safer automobiles.

He will consider three main components of the system, driver, vehicle, and vehicle surround. He will discuss various issues and ideas for developing models for these main components as well as activities associated with the complex task of safe driving.

The presentation will include discussion of novel sensory systems and learning algorithms for capturing not only the dynamic surround information of the vehicle but also the state, intent and activity patterns of drivers.

He will also introduce a new type of visual display called “dynamic active display”. These displays present visual information to the driver where driving view and safety-critical visual icons are presented to the driver in a manner that minimizes deviation of her gaze direction without adding to unnecessary visual clutter.

These contributions support the practical promise of the “human-centric active safety” (HCAS) systems in enhancing the safety, comfort, and convenience.

For videos and publication list visit LISA

Developmental Agents for Vision

In this talk, Marco Gori introduce the notion of developmental agents, that are based on the theory of “learning from constraints” (see e.g. http://videolectures.net/marco_gori/).

Perceptual and logic constraints

Perceptual and logic constraints

It is claimed that in most interesting tasks, learning from constraints naturally leads to “deep architectures”, that emerge when following the developmental principle of focusing attention on “easy constraints”, at each stage. Interestingly, this suggests that stage-based learning, as discussed in developmental psychology, might not be primarily the outcome of biology, but it could be instead the consequence of optimization principles and complexity issues that hold regardless of the “body”.

In the second part of the talk, he gives insights on the adoption of the proposed framework in computer vision. The proposed functional approach leads naturally to develop different notions of features, the lower level of which are somehow related to classical SIFT. It is pointed out that the adoption of information-theoretic principles are at the basis of the feature generation either at low or high level of the vision computer hierarchy. The functions that are developed are inherently independent of roto-translations and do acquire scale invariance by the minimization of an appropriate entropy-based measure, that is also at the basis of the focus of attention.

Finally, he gives an overview of different constraints emerging at different layers of the hierarchy, and claim that the overall system is expected to work in any visual environment by acting continuously, with no separation between learning and scene interpretation.