Fast variable selection for memetracker phrases time series prediction

Yoan Miche*, Tatiana Chistiakova, Anton Akusok, Amaury Lendasse, Rui Nian, Alberto Guilléhn

*Motsvarande författare för detta arbete

Forskningsoutput: Kapitel i bok/rapport/konferenshandlingKonferensbidragVetenskapligPeer review

Sammanfattning

This paper proposes a methodology using a fast variable selection as a modified version of the Forward-Backward algorithm. This methodology is adapted to the specificities of the data used: very small number of samples and high number of variables. Such data is generated using underlying dependencies and seasonality assumptions, from Meme phrases volume data. By the use of a resampling technique along with the proposed variable selection scheme, significant results are obtained, and the test Normalized Mean Square Error performances are improved. The results indicate that with the assumptions made on the data structure, variable selection is desirable. Also, the obtained information on the selected variables seem to cluster the time series in two very different classes: a set of approximately 600 series, which yield good NMSE, and seem to require very similar sets of variables for the prediction; and another set of 300 - 400 series, for which only the previous series value is of interest for the prediction. This first analysis clearly illustrates the future need to perform a more thorough analysis of the selected variables for each of the batch of series. Also, taking a close look at the possible dependences between the series inside a batch should give information as to why and how they are similar and have found themselves to be grouped under the same batch.

OriginalspråkEngelska
Titel på värdpublikation5th International Conference on PErvasive Technologies Related to Assistive Environments, PETRA 2012 - Conference Program
Utgivningsdatum01.12.2012
Artikelnummer47
ISBN (tryckt)9781450313001
DOI
StatusPublicerad - 01.12.2012
MoE-publikationstypA4 Artikel i en konferenspublikation
Evenemang5th International Conference on PErvasive Technologies Related to Assistive Environments, PETRA 2012 - Heraklion, Crete, Grekland
Varaktighet: 06.06.201208.06.2012

Fingeravtryck

Fördjupa i forskningsämnen för ”Fast variable selection for memetracker phrases time series prediction”. Tillsammans bildar de ett unikt fingeravtryck.

Citera det här