signature file

     

Signature files is a technique applie for document retrieval. The idea behind Signature files is to create a quick and dirty filter that will keep all the documents that match to the query and hopefully a few ones that do not. The way this is done is by creating for each file a signature, typically a hash coded version. One method is superimposed coding. A post-processing step is done to discard the false alarms. This structure since in most cases is inferior to inverted files in terms of speed, size and functionality, is not used much. However, with proper parameters it can beat the inverted files in certain environments.