A two-column data.frame is returned for id_type and organism. Organism is "unknown" for identifiers where there this can't be inferred (like Refseq).

If an identifier matches more than one id_type, the id_type is set to "ambiguous". If the identifier doesn't match any guesses, then "unknown".

guess_feature_type(x, with_organism = TRUE, summarize = TRUE)

Arguments

x

a character vector of ids

Value

data.frame with id (x) and id_type. If with_organism = TRUE, a third organism column is added with a guess for the organism.

Details

This was extracted from the guess_id_type in the DenaliSigDb package. We are using it as temporary bandaid to extract the "feature space" from results. These should be straightforward to extract when all analyses are going through some type of FacileDataStore.