Search in html source with GOOGLE? [closed] - search

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 1 year ago.
Improve this question
I have several websites, and I can't remember where I wrote some lines of code. As my pages are indexed by Google, I would like to know if Google offers a facility to search within the HTML source code/mark-up itself, instead of just allowing search within the visual, rendered, part of a page?
Thanks

I've come across the following resources on my travels (some already mentioned above):
HTML Mark-up-focused search engines
Nerdydata
I'd also like to throw in the following:
Huge, website crawl data archives
Common Crawl - 'years of free web page data to help change the world' (over 250TB+)
How can we analyze this crawl data?
For an idea of how to begin analyzing some of this massive data, take a look at Big Data/Map-reduce-type frameworks(s).
Google lists some ideas on using Apache's Spark project to analyze Common Crawl's dump(s). To understand the file format(s) used by Common Crawl, refer to the following:
So you’re ready to get started [with Common Crawl]
Navigating the WARC file format [by Common Crawl]
The article, Accessing-Common-Crawl-Dataset-on-S3, outlines accessing Common Crawl's 250TB+ dump(s) in a low cost manner without transferring that data load outside of Amazon's AWS/S3 network. Of course, that assumes you are going to use some combination AWS/EC2/S3 etc. to analyze the crawl data.
Finally, Patrick Durusau maintains some interesting Common-Crawl-usage-related blog pages.
Personally, I find this subject intriguing, I suggest we get this crawl data while it's HOT! ;-)

You can try PublicWWW for search in source/mark-up. It allows to find any HTML, JavaScript, CSS and plain text in web page source code on 167+ million websites.
With PublicWWW you can:
Find related websites through the unique HTML codes they share, i.e.
widgets & publisher IDs.
Identify sites using certain images or badges.
Find out who else is using your theme.
Identify sites mentioning you.
Find your competitor's affiliates.
Identify sites where your competitors personally collaborate or interact.
References to use a library or a platform.
Find code examples on the net.
Figure out who is using what JS widgets on their sites.
...
Of course you can find not only your websites which use some code/mark-up snippet.

Related

Archiving scenarios and revisiting stories [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
This question does not appear to be about programming within the scope defined in the help center.
Closed 5 years ago.
Improve this question
Working on a large and complex application, I wonder where and whether we should be storing scenarios to document how the software works.
When we discuss problems with an existing feature, it's hard to see what we have already done and it would be hard to look back with a scrum tool such as TFS. We have some tests but these are not visible to the product owner. Should we be looking to pull out some vast story / scenario list, amending / updating as we go or is this not agile.
We have no record of how the software works other than the code, some unit tests,some test cases and a few out of date user guides.
We tend to use our automated acceptance tests to document this. As we work on a user story we also develop automated tests and this is part of our Definition of Done.
We use SpecFlow for the tests and these are written as Given, When, Then scenarios that are easy to read and understand and can be shared with the product owners.
These tests add a lot of value as they are our automated regression suite so regression testing is quicker and easier, but as they are constantly kept up to date as we develop new stories they also act as documentation of how the system works.
You might find it useful to have a look at a few blogs around Specification by Example which is essentially what we are trying to do.
A few links I found useful in the past are:
http://www.thoughtworks.com/insights/blog/specification-example
http://martinfowler.com/bliki/SpecificationByExample.html
Apart from the tests we used also a Wiki for documentation. Especially the REST API was documented with request/response examples but also other software behaviour (results from long discussions, difficult to remember stuff).
Since you want to be able to match a description of what you've done to the running software, then it sounds like you should put that in version control along with the software. Start with a docs/ directory, then add detail as you need it. I do this frequently, and it just works. If you want to make this web-servable, then set up a web server somewhere to check out the docs every so often and point the document root at the working copy docs/ directory.

Flowchart for the Web [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 8 years ago.
Improve this question
I am perfectly aware that "SO is not a place to ask for a tool" so I did my own search and did find a few tools, but, since I need a few specific features - I do believe that combined expertise and experience of SO users could be very useful to what I am looking for. Which is: a library to be able to build/display on a Web page a flow chart similar to this:
As I mentioned I've seen jsPlumb and a few others, but it takes too much time to research individual features of each, so I hope someone who has experience could suggest the best choice. The features I am looking for:
Browser support IE8+, Chrome, FF
Ability to add/edit/delete chart items on the fly either visually or programmaticaly
Ability to update backend (ASP.NET/SQL Server) with chart data
Ability to have flexible connectors (like from Button 3 to State 1 in the example above)
Ability to load chart data and have items auto-arrange into default/optimal layout
Possible library to have a non-free/commercial option (this is a stupid one, but this is what the business side is asking for, they're wary of free products)
I'll possible get quite a few downvotes for this question, but I am willing to chance it, hoping that someone could suggest a good flowcharting library that fits my needs.
As naughty as your question is, mxGraph is the only diagramming library with full IE8- support (down to IE 6). And that's with full functionality, try draw.io on IE 8.
1) Yes, and Opera, Safari, iOS 5+, Android 4+, Windows 8 touch, ChromeOS.
2) Yes
3) We have a full graph model implementation on the server in .NET, it's easier to integrate that a JavaScript only solution.
4) Yes
5) Yes
6) Yes
And yes, I work on the product.
Let's see how yFiles for HTML compares:
Browser support IE9+, Chrome, FF, (and more) (IE8 is a no-go, since HTML-5 is required and IE8 simply does not support that level, neither on the visualization, nor on the Javascript level).
There is a very rich API that can be used to programmatically change any aspect of the model and view. The default editing capabilities allow for all standard edit operations, custom ones can easily be added.
yFiles for HTML 1.1 comes as a pure client side application, so currently server connectivity needs to be added by the programmer. Since the library is server-agnostic it can be made to work with any server technology - samples included in the package.
Flexible connectors are in fact the default: Any number of bends and labels can be added to connectors, which connect to two ports. Both types of items can be configured and customized to a very high degree.
An outstanding feature of the library are the versatile automatic layout algorithms. In general it is not possible to get "optimal" layouts, but the (heuristic, at least most of them) algorithms are considered "best-of-breed".
The library is a commercial library, only.
Whether the library actually fits your needs depends on a lot of factors. Unless (1) is a blocker for you, I think yFiles might be a good candidate. You can try it by evaluating it from its homepage.
I recreated your sample in the online editor and used the following simple code configuration to create an automatic layout:
var ihl = new yfiles.hierarchic.IncrementalHierarchicLayouter()
ihl.automaticEdgeGrouping = true
ihl.backloopRouting = true
ihl.layoutOrientation = yfiles.layout.LayoutOrientation.LEFT_TO_RIGHT
ihl.integratedEdgeLabeling = true
// animate and apply the layout
graphControl.morphLayout(ihl, yfiles.system.TimeSpan.fromSeconds(1), null)
This is what I got:
Note: You did not ask for a plugin-free solution, but I answered your question as if you did. If plugins like Flash, Silverlight, or Applets are acceptable, then the sibling yFiles products for the respective platforms might also be an option. They work on all browsers that support those plugins, so even the oldest IE versions are supported.
Disclaimer: I work for the company who creates that library, however I do not represent my employer here at SO/SE. My thoughts, comments, and posts are my own.
GenMyModel is now supporting flowchart diagramming:
You could view and fork this sample here

What application help system (like chm files) exist on Linux/GTK? [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 5 years ago.
Improve this question
On windows CHM is a very good option.
Is there anything other then delivering a static set of HTML pages and using a primitive call to a webbrowser (which is even a problem itself on linux). And it would not offer any kind of fulltext searching, separated bookmarks and even the simple fact of not opening a new tab for each help call.
The Gnome yelp program is what is used for GTK/Gnome applications. It supports a number of formats, but not CHM directly. They have started to define their own markup, named Mallard. But I don't know what is the status of that.
I'd still recommend static HTML as the best option (and of course man pages!). For example you can use Sphinx to write beautiful documentation with a full-text search support!
There are CHM viewers available on Linux though frankly as a Linux user I'd prefer to get static HTML pages.
Some examples are chmsee and kchmviewer.
Afaik there is no universal system. Depending on your desktop system (gnome/kde) there might be helpsystems, but they are usually based on loose files and use full-blown browsers. (usually webkit based)
For Lazarus a CHM based helpsystem and embedded browser was created, including CHM write support.
The reasons to avoid loose static html were mostly:
the 60000 lemma static documentation took too long to install on lighter systems or systems with specialist filesystems.
CHM removes slack and adds compression.
we also support non posix and OS X systems, and little filesystem related problems (charsets/encoding, separators, path depth etc) and case insenstive filesystems on *nix caused a lot of grief. The CHM based help solved that, allowing for one set of routines to access helpdata on all systems.
indexing and toc are Btree based, and can be easily merged runtime from independently produced help sets. In general integrating independently produced helpfiles is a underappreciated aspect of helpfiles in general, while key to open platforms.
native fulltext search.
An own viewer also has the ability to take advantage of extra features on top of the base system.
I'm not mentioning the Lazarus system in the hope you adapt it, since it is at the moment too much a development system (SDK) oriented system, the viewer is not even available as a separate package. I mainly mention it to illustrate the problems of loose html.
I haven't investigate KDE/Gnome/Eclipse what they use as helpsystem for a while though. If I would have to restart from scratch, that's where I would look first.
If I had to create something myself quickly, I would use zipped static html, and a single gziped file with metadata/indexes and the lightest browser (Konquerer?) I could find. Not ideal, not like Windows, but apparently the best Linux can offer.

Programatically generate graphical (ADAPT) models / diagrams of structured data [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 8 years ago.
Improve this question
I am searching for strategies or other starting points to generate diagrams from structured data stored in a database. To make a simplified example that would be very similar to my requirements:
Input: DDL SQL to create tables with attributes in a Database.
Desired output: ERD diagram of the tables with connections to each other as svg.
The type of diagram I want to build is ADAPT but since I'm sure there is no implementation that directly generates ADAPT diagrams I'd be happy with anything that can generate models like BPMN, EPK or something similar. A generator for ERD might be a start but since ADAPT is more complex than ERD it would require heavy customization.
I currently create a prototype using Drupal so something in that area would be nice but I am glad with any input where I can learn about solving the core problem: How to automatically create graphical models out of structured data (without data about or user interaction in terms of element placement, element size etc.)?
Sidenotes:
svg is no hard requirement (could be png etc) but seems the best option to me.
The application I am building records requirenments that then should become the models.
The solution needs to be integratable into a web application. (have some sort of api)
Activiti could be a base for the diagram drawing but I cannot find information about diagramm creation via API instead of the UI.
Graphviz might be a starting point but I haven't seen any work beyond ERD.
This is for a student project so the budget is limited.
There is a number of diagramming libraries available and each have their strengths and weaknesses. It strongly depends on your exact requirements. I suggest you evaluate some of the alternatives and see what suits you best.
What I think: In a commercial scenario, the yFiles suite of libraries for various platforms is probably an option. From what I can see the HTML/JavaScript based solution should satisfy your requirements quite well: It can easily be told to fetch the data from various input sources, automatically create the structure and most importantly calculate a layout for the diagram so that it can be viewed immediately without having to be arranged by hand. Also you can customize the library to a very high degree: you can make the items look like the ones in "ADAPT", customize user interactivity so that only valid model changes are allowed (if you need interactive editing), and of course do all of this programmatically using a flexible Javascript API.
There is a nice example of that capability here:
Interactive Graph Source
Full disclosure: I work for yWorks, however on Stackoverflow I do not represent my employer.
You can generate Graphviz' DOT language and then run Graphviz' DOT.exe program by command line, which inputs a DOT language file and outputs an .svg file. This approach is pretty simple since you don't need to use Graphviz by API, and instead, just generate text file from whatever your programming environment is.
Not sure if this help you at all; perhaps just the general idea may be useful.
Going further, though, I've done this in an open source project, with the salient differences from what you're specifically describing above being that my project takes tabular (self-describing, structured) data instead of SQL DDL as its user input. Output is information in the form of concepts and relationships that can be used in several ways (searched, manipulated, diagrammed), one of which is to generate an .svg output showing various colored and shaped entities interconnected by variously colored and labeled arrows. The .svg output is done by translating concepts and relationships into an appropriate DOT file and then running Graphviz by command line, and then launching Firefox on the .svg output. Lots of differences from what you need, still, layout is handled purely by Graphviz.

PDF generation from XHTML in a LAMP environment [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 2 years ago.
Improve this question
Can anyone recommend a good server-side PDF generation tool that would work in a Linux environment. I want easy as possible, pass it a XHTML file (with images) and have it generate a PDF from the rendered source.
I don't have a massive budget, but anything under $1000 should be alright.
Andrew
I used HTMLDoc about 8 years ago and it did a good job of turning HTML tables with some basic formatting into a decent PDF report. There also seems to be an open source version as well.
I sounds like FPDF might be of help...
Also, the creation of PDF documents is called "PDF printing". I believe that might help you find other resources.
You might want to take a look at FOP, which stands for Formatting Objects Processor. It can generate PDF files on linux since it is Java based. From their site:
Apache FOP (Formatting Objects Processor) is a print formatter driven
by XSL formatting objects (XSL-FO) and an output independent formatter.
It is a Java application that reads a formatting object (FO) tree
and renders the resulting pages to a specified output. Output formats
currently supported include PDF, PS, PCL, AFP, XML (area tree
representation), Print, AWT and PNG, and to a lesser extent, RTF and
TXT. The primary output target is PDF.
You can find it here
I did some searching, what about tbookdtd?
It's downloadable here but it hasn't been active since 2005. It appears to convert the xml to Latex, into PDF.
Have you investigated PHP's documentation? There's also PHP FAQ with a few different links. PHP primarily supports PDFlib.
I have recently came across dompdf which I have used to convert pages created in HTML into PDF documents. It uses PHP5 (assuming using PHP does not bother you). This is also assuming that you don't want to statically create HTML files on the file system and then convert them using some kind of command-line tool?
One problem I found with dompdf is that you don't get a whole lot of configuration options natively, but it is open-source and doesn't seem to be too large, so you could probably jury-rig something up pretty easily.
If you do have a budget take a look at the following OpenEdge. I know that they did excatly what you want for us. A linux based PDF generation system.
I'd ask what they can do for you. Val Cassidy is the persons name.
BTW: I'm not getting anything for this and I don't even work for bespoke company anymore nor for OpenEdge ...
You could take a look at using OpenOffice via the OpenOffice API to load your XHTML document and export a PDF version. There is a bit of a learning curve to using the OpenOffice API but it is very powerful and can be run in server mode on systems without any graphical interface. It performs well - we've used it on some internal projects.

Resources