Title: TDR Targets
1The TDR Targets Database
Prioritizing potential drug targets in complete
genomes
2Prioritizing targets in whole genomes
- TDR Targets facilitates the prioritization of
targets in complete genomes by allowing users to
search for targets using defined criteria AND to
assign scores (weight) to these queries. - Although you can use TDR Targets in the same way
you use other genome databases (search, then
view), the full potential of the database is
exploited by working differently. - Search, Search, Search
- Assign scores to each search (weight queries)?
- Combine all weighted queries to obtain a ranked
list of genes - This is the focus of this tutorial, and in the
following slides we will show you how to
prioritize a genome in this way.
3Prioritizing targets in the genome of M.
tuberculosis
- In the next slides, we will give you a tour of
TDR Targets, showing you how you can prioritize
potential drug targets in the genome of
Mycobacterium tuberculosis. - Remember that because we want to be able to
assign different scores to each of our search
criteria, we need to separate these criteria in
different searches.
4Our list of criteria
- Here is a list of criteria we will use to
prioritize targets - Target is an enzyme (potential for assayability,
good druggability precedents)? - Target has low molecular weight and no
transmembrane domains (higher chances of
producing soluble active recombinant proteins - Target has either a known 3D structure or a 3D
model. - Target is expressed in the dormant stage
(clinically relevant disease phase)? - Target is essential
- Target is absent in humans (or other mammals) and
present in other bacteria - Target has some precedent for druggability
5(No Transcript)
6Target is an enzyme
Our first search will look for enzymes as these
are usually good drug targets. For this, we
expand the corresponding section, and check the
corresponding box for Functional category.
2.
7Optionally name your queries
Before running your query there is an optional
step, in which you can give your query a
meaningful name. This is useful so that you can
later identify easily each of your queries in the
Query History page.
3.
This section appears at the bottom of the search
page.
This is how your query will appear in the site's
History.
8Target has low MW and no TM domains
We have already run our first search. We are not
interested in the results for now, so we procceed
to our second search. As before we should first
specify our species of interest, then our search
criteria.
1.
2.
9Target has a 3D structure
In our next query we search for targets that have
an experimental 3D structure. Note that in the
same search section we can also specify that the
target should have a 3D model. But because we
want to be able to score these two criteria
differently, we do separate searches in this case.
1.
2.
10Target has a 3D model
All the queries we've been running have been
accumulating in the Query History page. Remember
to give them meanignful names!
1.
2.
11Target is expressed in the dormant stage
For queries involving gene expression, we will
perform two separate searches, so that we can
give different scores to each later. In the first
query we will select genes in the top 80-100
expression rank, and in a second query, those in
the 60-80 rank.
1.
2.
12Target is essential
1.
2.
M. tuberculosis is one of the species for which
genome-wide knockout data is available.
13Target is absent in humans
... and present in other bacteria.
1.
2.
14Target has some precedent for druggability
The predicted druggability index (range 0-1) is a
combined index with many components, such as
availability of known druggable orthologs,
similarity of target vs known druggable targets,
structural conservation of binding sites, etc.
1.
2.
15That should be enough
- In total we have done 10 searches using different
criteria, which we think will allow us to find
good targets in the genome of M. tuberculosis. - It's now time to go to the history page, add
weights to each of these queries and combine them
to obtain a ranked list of genes
16(No Transcript)
17Let's think about this for a second ...
- It is important to understand why you have to
calculate the UNION of the selected queries to
produce a ranked list of genes. - By calculating the INTERSECTION of the selected
queries you will get only those genes that are
present in all queries. And therefore all genes
in this final list will have the same score (the
sum of all scores). Pretty boring huh? - When you calculate the UNION, tough, any gene
that is present in at least one of your queries
will end up in the final list. However, genes
that are present more than once will have their
scores added. Note also that the final list will
also include genes that were present in all the
queries (the same ones you'd get by calculating
the INTERSECTION). These genes (if any) will have
the maximal score and will be listed at the top.
18The ranked list of genes
After calculating the UNION of the selected
queries, a new query in your History page will
appear, containing the results of your
prioritization. In this case, the ranked list
contains 3892 genes (which is gt 95 of the genes
in the genome of M. tuberculosis). You can click
on Show parameters to expand the view and reveal
the weighting strategy used (useful if you later
decide to delete all previous queries).
19The ranked list of genes
These are the top targets in the prioritized
list, sorted by descending weight.
20That's all for now
- In this quick tour of the TDR Targets database,
we showed you how you can use the database as a
tool to prioritize targets in a genome, based on
a defined set of criteria. - There are other aspects of TDR Targets that we
didn't cover in this tutorial. For more quick
guides, please head to - http//tdrtargets.org/tutorials or
- http//slideshare.net/tdrtargets
- Some of the slideshows available are
- Introduction to the TDR Targets Database
- Target Surveys in TDR Targets
- Sharing information with others in TDR Targets