A Few Quick Thoughts On SiteImprove’s API

I had a chance to work with SiteImprove’s API recently, and wanted to capture my thoughts and impressions while they were still fresh in mind.

First, the documentation was above average. Once you’ve received your API key, you can literally “test drive” particular methods within the documentation, right below the area where that particular method is explained. You see everything, the response, the headers, etc. which comes in handy when you’re troubleshooting an issue.

Second, the “next page link” being included in the results when the amount of matching items exceeds the amount per page (defaults at 10, can be maxed out at 1000). Makes recursive requests dead simple– if the “next link” exists, drop it in as your url variable’s next value, and let your for loop continue. If the “next link” doesn’t exist, break out of your loop. And the “next link” already has the correct url attributes appended to it, so you don’t need to worry about updating it or dealing with page number and items per page in your code.

Ok, so what did I *not* like? SiteImprove uses header responses to convey information about your current rate limit (e.g. 50), how many requests you have remaining (e.g. 25) and the rate reset (e.g. 5 seconds). That’s right: integer, integer, string? Why they didn’t keep the last one an integer and opt for milliseconds (e.g. 5000) is beyond me. But if slicing off the text and converting the result to milliseconds for a sleep() interval is the worst thing I have to complain about, that’s pretty good overall.

Secret User Agent Man!

torn up fortune cookie message

I was creating a PHP helper class to leverage the NPS Data API recently, and stumbled across an odd situation where I could successfully retrieve the API’s JSON result in Google Chrome with the ModHeader extension— but PHP’s file_get_contents function would fail with a Server 500 error.

After several  Google searches, I stumbled across this post on Stack Overflow which got to the heart of the problem. It turns out PHP’s default user agent string for file_get_results is blank, and some web servers are configured to ignore requests with empty user agent strings. Once I added an Agent String in my PHP’s header request, it started to return results as expected.