Most Big Data sources are unstructured. “Virtually no
analytics directly analyze unstructured data. Unstructured data may be an input
to an analytic process.” Franks gives the example of television shows like CSI
where direct matching of fingerprints and facial images is shown to happen
often. Fingerprint images cannot be compared directly since fingerprint imagesare unstructured data. Also, a high quality fingerprint image can be quite
large in size. So unlike on CSI, fingerprints are first analyzed and a set of
important points are identified on each print.
The points help create a graph.
It is the graphs from different fingerprints that are matched. The graph isfully structured and smaller in size. While unstructured prints are an input tothe process, the actual analysis to match them up does not use the unstructured
images but rather structured information extracted from them. The images
themselves could be stored on slower disk drives and could be used for
verification once a match is obtained, but the graphs are stored in databases
and are used for matching.

No comments:
Post a Comment