How can I loop through my “getElementById” VBA for multiple websites?


How can I loop through my “getElementById” VBA for multiple websites?



I'm part of a non-profit that sends letters to encourage hundreds of people in prison. They are often transferred unexpectedly, with no time to give notice of address change. However, each person's location while incarcerated is kept up-to-date and publicly accessible on the state government's website.



I am trying to write VBA that goes through my "contact" list and visits each state government's prisoner location website (based on each prisoner's ID), then extracts each person's location from the website, places that in a column ($C) for that purpose which corresponds to the row for that specific person's name & ID. That way I could run a check automatically to confirm each one is still at the same location before I do an Excel mailmerge to print envelope labels with their addresses.



Here's what I'm using to get the correct value (I've just been testing with a MsgBox CFTitle)


Dim IE As New InternetExplorer
IE.Visible = False
IE.navigate "http://mdocweb.state.mi.us/OTIS2/otis2profile.aspx?mdocNumber=" & Range("PrisonerID").Value
Do
DoEvents
Loop Until IE.readyState = READYSTATE_COMPLETE
Dim Doc As HTMLDocument
Set Doc = IE.document
Dim CFTitle As String
CFTitle = Trim(Doc.getElementById("valLocation").innerText)



And here is a screenshot of an example list of names (with actual Prisoner IDs), using the same columns as my list:
Example of Excel Contact Sheet




1 Answer
1



This is a quick way.



I read the prisoner ids into an array from a sheet (column K). If you read in from a sheet you get a 2D array and then loop the first dimension to get the ids.



I loop that array issuing a browserless XHR request for each id. This is a quick way to retrieve your information via GET request.


GET



I use .getElementById("valLocation") to get the correctional facility information.


.getElementById("valLocation")



I store these results in an array called facilities.


facilities



At the end I write the ids and locations out to the sheet, column C, with:


.Cells(2, 3).Resize(UBound(facilities) + 1, 1) = Application.WorksheetFunction.Transpose(facilities)



VBA:


Option Explicit
Public Sub GetInfo()
Dim sResponse As String, ids(), facilities(), i As Long, ws As Worksheet, counter As Long
Set ws = ThisWorkbook.Worksheets("Sheet1") '<==change as appropriate
ids = ws.Range("K2:K" & GetLastRow(ws)).Value
ReDim facilities(UBound(ids, 1) - 1)
Application.ScreenUpdating = False
On Error GoTo errhand
With CreateObject("MSXML2.XMLHTTP")
For i = LBound(ids, 1) To UBound(ids, 1)
counter = counter + 1
.Open "GET", "http://mdocweb.state.mi.us/OTIS2/otis2profile.aspx?mdocNumber=" & ids(i, 1), False
.send
sResponse = StrConv(.responseBody, vbUnicode)
sResponse = Mid$(sResponse, InStr(1, sResponse, "<!DOCTYPE "))

With CreateObject("htmlFile")
.Write sResponse
facilities(i - 1) = .getElementById("valLocation").innerText
End With
NextId:
Next i
End With
With ws
.Cells(2, 3).Resize(UBound(facilities) + 1, 1) = Application.WorksheetFunction.Transpose(facilities)
End With
Application.ScreenUpdating = True
Exit Sub

errhand:
Debug.Print counter
Debug.Print Err.Number & " " & Err.Description
Select Case Err.Number
Case 91
Err.Clear
facilities(i - 1) = "Not found"
GoTo NextId
End Select
Application.ScreenUpdating = True
End Sub



Result in the sheet:



Result in sheet





This worked splendidly on my test data, but when I ran it on my entire list of over 300 lines, I discovered it could only run on a varying number of entries (between 10-150), or it would give the "Run-time error '91' -- Object variable or With block variable not set." Alternately, sometimes it would give the same "Object variable or With block variable not set" without naming 91 specifically. When I debug, it goes to the line of " facilities(i - 1) = .getElementById("valLocation").innerText " I'm not seeing anything that should limit it being used on -- any thoughts? Thanks much!
– buildonrock
Jul 1 at 12:29






It may simply be too fast as a possibility assuming id valLocation is present for all of them. Can you verify when it falls over if this element id is present? Does it work fine if you process in batches? Let me know and I can advise further.
– QHarr
Jul 1 at 12:42





Potentially, if you provide me with a full list I can debug from this end. Also, some sites may not like large numbers of requests within a short span of time.
– QHarr
Jul 1 at 12:52






Thanks for the thoughts, @QHarr, I do wonder if it is something external since there is such variation between the number of IDs it can process. Sometimes it won't process 5, while once it processed 150 at once. Here's a long list to test
– buildonrock
Jul 2 at 12:23






Some who are discharged return "" for id valLocation -- but it has still worked for those, just nicely outputting "" into column C. I did find one erroneous ID (a typo), which stopped the code and gave a red warning box (can't remember what it said unfortunately, and now I can only get run-time error 91)
– buildonrock
Jul 2 at 12:29






By clicking "Post Your Answer", you acknowledge that you have read our updated terms of service, privacy policy and cookie policy, and that your continued use of the website is subject to these policies.

Popular posts from this blog

PySpark - SparkContext: Error initializing SparkContext File does not exist

django NoReverseMatch Exception

List of Kim Possible characters