Same Stats, Different Graphs: Generating Datasets with Varied Appearance and Identical Statistics through Simulated Annealing

論文URL:http://dl.acm.org/citation.cfm?doid=3025453.3025912

論文アブストラクト:Datasets which are identical over a number of statistical properties, yet produce dissimilar graphs, are frequently used to illustrate the importance of graphical representations when exploring data. This paper presents a novel method for generating such datasets, along with several examples. Our technique varies from previous approaches in that new datasets are iteratively generated from a seed dataset through random perturbations of individual data points, and can be directed towards a desired outcome through a simulated annealing optimization strategy. Our method has the benefit of being agnostic to the particular statistical properties that are to remain constant between the datasets, and allows for control over the graphical appearance of resulting output.

日本語のまとめ:

「日本語のまとめ」はツイッターに投稿する予定です。ツイッターでは110文字程度まで表示可能です。それ以降はツイッターに投稿する際にはざっくり削除されます。ウェブサイト上では削除されずに残りますが、一方であまり長いとまとめの意味がなくなるので、110字程度でお願いします。修正したい場合には、再度この画面から登録してください。一番最後に登録したものが採用されます。

(128文字)