When you go to the Freebase Data page, there is a list of categories. Drilling down into these, you can see the top-level domains and their types. Looking at all the types in a domain is easy enough, but I could not figure out a way to get the categories and top-level domains. The documentation was no help.

I finally found the answer by looking at the mjt source code at this site:
http://mjtemplate.org/examples/schemas/index.html

Here’s the query to retrieve the categories:

{
  “query” : [
    {
      "name" : null,
      "type" : "/freebase/domain_category"
    }
  ]
}

And here’s the query to retrieve the top-level domains for a category:
(Caution: This query will take a long time to run as it returns the top-level domains and their properties

{
  “query” : [
    {
      "domains" : [
        {
          "id" : null,
          "name" : null,
          "type" : "/type/domain",
          "types" : [
            {
              "/freebase/type_profile/instance_count" : null,
              "/type/reflect/any_reverse" : {
                "limit" : 1,
                "link" : {
                  "master_property" : "/freebase/type_hints/hinted_type"
                },
                "mediator" : null,
                "optional" : true,
                "type" : "/freebase/type_hints"
              },
              "id" : null,
              "name" : null,
              "optional" : true,
              "properties" : [],
              “sort” : “name”
            }
          ]
        }
      ],
      “name” : null,
      “type” : “/freebase/domain_category”
    }
  ]
}

I wish some of the types such as “/freebase/domain_category” or “/freebase/type_hints” were documented somewhere.

Can you imagine creating biological machines just as simply as writing code?

I am guessing your answer is no. Until today, I couldn’t either. Then I watched this amazing video of Drew Endy at the Chaos Communication Congress. It is totally cool stuff. Here’s a synopsis of the talk:

Genetic engineering is now a thirty year old technology. For
reference, over a similar period of time, modern computing machines
went from exclusive objects used to design weapons of mass destruction,
to the now ubiquitous panoply of personal computing devices that
support mass communication and construction. Inspired by this and many
other past examples of the overwhelmingly constructive uses of
technology by individuals, we have been working over the past five
years to develop new tools that will help to make biology easy to
engineer. We have also been working to foster a constructive culture of
future biological technologists, who can reliably and responsibly
conceive, develop, and deliver biological technologies that solve local
problems.

This talk will introduce current best practice in biological
engineering, including an overview of how to order
synthetic DNA and how to use and contribute standard biological parts
to an open source collection of genetic functions. The talk will also
discuss issues of human practice, including biological safety,
biological security, ownership, sharing, and innovation in
biotechnology, community organization, and perception across many
different publics. My hope is that the conferees of 24C3 will help me
to understand how to best enable an overwhelmingly constructive hacker
culture for programming DNA.

Watch the video, then head on over to the Registry of Standard Biological Parts. Prepare to be amazed by a Radio Shack-esque catalog of biological parts. Think tagging is only for Web 2.0 photos and videos? Think again…here’s a collection of biological tags for biological engineers.

One of annoyances on XP and Windows Server 2003 is the appearance delay of the “Send To” menu because it does insane things like look for the presence of a floppy drive. You can eliminate the delay by following the detailed instructions here.

Dennis McDonald tagged me in the “8 Things You Didn’t Know About Me” game so here goes:

  1. I love Chai and drink a lot of it (not the Starbucks kind…the regular Indian kind which is way better).

  2. I am a digitalmediaholic. I have over 6 terabytes of ripped movies and 100 GB of music.
  3. I am a dot com bubble survivor. I founded a venture-funded startup called iWidgets.com that created widgets for web pages. Unfortunately, the technology was six years ahead of its time.
  4. My un-Americanized name is Nikunj, pronounced nick-ooonj.
  5. I have never had any alcoholic beverage…none.
  6. I became a vegetarian in 2001…did it cold turkey (pun intended).
  7. I grew up in Bombay and came to the U.S. in 1987 for my undergrad education. Never went back, and became a U.S. citizen a few years ago.
  8. I enjoy the creative aspects of brand development, especially logo development, tag line creation and messaging.

Lots of blog buzz about Facebook shutting-down Scoble’s account for running a script against Facebook, thus violating the site’s ToS. It appears that an ill-conceived experimental Plaxo Pulse script that used screen-scraping to retrieve email addresses is the culprit. I empathize with Scoble and given the facts, also think Facebook was justified in suspending his account. This post, however, is about a bigger, related issue that the event highlights. (No, it’s not about who owns the data either.)

I am concerned about the general complacency and casual attitude that people generally have about Web 2.0 data security. In this case, an extremely tech savvy individual allowed a test script from another (supposedly) tech savvy company to be executed against production data. That’s insane, no?

Not quite…it’s no different than downloading a beta app from a website and allowing it to run on your desktop. You really have no idea how it’s going to fudge-up your machine and your data, but you probably do it anyway once you determine the risk is acceptable.

In both situations, it comes down to Trust, Ignorance and Recklessness or a combination thereof. While there are now adequate protections available for desktop apps so your data can be reasonably protected, the same is not true for Web 2.0 apps. It’s the Wild West out there. With the proliferation of API’s, widgets and mash-ups, you have no idea where your data is being stored, who has access to it and what apps that are accessing it are doing with it. None. And you don’t have a prayer of a chance of ever finding out.

We need standards not only for open, cross-site access to user data,
but we also need standards that provide for how the data is persisted
by different sites. I don’t have the time to read the ToS for each
site, and even if I did, I have no way of verifying that what they say
about the privacy and security of my data is actually what is happening
in the data center. With a high probability, I can assure you that it
is seldom as air-tight as the ToS legalese would have you believe.

DataPortability.org appears to be a good start for enabling access to ones data, but that is only one half of the equation. In a distributed online world, we need standards that provide transparency about how the data is being stored, verifiable means for ensuring that ToS are being adhered to by sites and auditability about when/by whom/how our data is accessed. We need bread crumbs associated with user data no matter where it is persisted and this information needs to be accessible to us in an easy, centralized location regardless of where the data is stored. Better yet, we need a standard means of encrypting our data across multiple sites. This is a tall order, and it is unlikely to happen anytime soon, but it will happen. There will be a standard because users will demand it and sites that don’t provide it will see their users leave en masse.

Today, it’s not a big deal for most people because their online information consists of photos and videos, but the more important stuff is still in silos. Our bank has some info, the credit card company has some and the travel company has some. How long before these silos also start having API’s? Wesabe is already doing it. Others will follow. Very soon, it’s going to be way more personal information than just pictures of your kid at the family picnic that are going to be strewn across the web.

We either have to give up most or all of our privacy or figure out a way to protect it as the distributed web evolves.

© 2012 TechBubble Suffusion theme by Sayontan Sinha