Tuesday 20 November 2012

Research - the start of the detailed trawl

So, if you've been wondering what on earth I could do with information about 1162 research papers you're not alone. I've been wondering myself as well!

The short answer is that I'm going to read them!

Naturally, many of the papers themselves are only accessible through subscription websites, and so the initial trawl is to look at the abstracts of the papers. I'm doing four things with this:
1) I'm adding the text of the abstract to my database
2) The abstract also includes the location of the research for the paper, so I'm adding that as well (there's a side analysis I'd thought about for looking at research locations)
3) I'm giving all of the papers a "relevance" score, out of 10 - where a low number is that the paper isn't particularly relevant (perhaps HSP is mentioned when the paper is looking at something else) and a high number is where the paper is very relevant.
4) I'm adding tags to the papers so that I can search on those tags, perhaps in combination with the relevance to get a set of papers.

This is going to be a slow process. I've done 10 papers so far, so I'm not 1% through the list yet. Oh, and the search results now gives 1185 papers, so there's another ~25 papers which have been published since August, but I think I'll leave the compilation of those until next year. It might be that I add a few extra fields into the database to group the papers further, but I'll get a few more papers through the list first. To save you reading about this every few weeks and marvelling at the snails pace that I do this, I'll wait till I get a sensible way through and provide some highlight information, and perhaps once I've got through this data pass I'll dig out the most relevant papers and try to get the full papers.

No comments:

Post a Comment