Oracle Big Data Handbook – part 2 reviewed

22 Tuesday Apr 2014

Posted by mp3monster in General, Oracle, Technology

Tags

adaptors, BDA, BerkeleyDB, Big Data, book, connectors, Hadoop, NoSQL, ODI, Oracle, Oracle Press, review, Sleepycat, sqoop

After an excellent start in Part 1 of Oracle Press’ Oracle Big Data Handbook (reviewed here). Part 2 moves on to looking at Apache Hadoop, Oracle’s Big Data Appliance and Oracle’s NoSQL offerings.

So chapter 3 provides a brilliant overview of Hadoop and the echo system that has been developed around it. Addressing the divergent versions of Map Reduce leading to the likes of YARN. Touching on how commericalised versions of Hadoop have been taken forward with this (such as Cloudera).

Apache Hadoop echo system

Moving onto to describe the core solution components such as Node Managers and the relationship to hardware and the use of more commodity kit rather than using nice expensive SAN technology.

Hadoop Structure

So now we have good (pretty much uncoloured by Oracle) view of Hadoop. Which leads into the the next chapter (chapter 4) which looks at why Oracle have taken the approach of an Appliance (which could be seen as contrary to the previous stated adoption of commodity kit).

Oracle Big Data

So as you can see Oracle woven together a set of technologies into an Exadata based platform which would not only deal with Big Data Analytics but ideally support other volume scenario needs so you’re not adding another data silo. all of which fits with Oracle’s Engineered Solutions view point. The book takes on a explains the other factors involved in the BDA design – those of commercial considerations and value propositions in relation to its customer base – very refreshing to see (rather than rationalisation through technical arguments alone).

The book addresses the challenge of why should I go to Oracle for big data? Which is well argued on the experience of very large relational deployments. Oracle’s contributions to Hadoop via Cloudera and so on. The chapter finishes with the argument around cost comparison between buying a comparable hardware solution to build your own cluster. Taking just list prices compared to HP and the hardware costs come in more or less the same, that’s before you account for the fact the Oracle price includes all the software.

Chapter 5 addresses the deployment of the BDA, explaining the configuration process, which with the combination of a tool called Mammoth (appropriate really) and the lies of Puppet seems pretty simple as a lot of the solution is preconfigured on the box ready. all of which is reasonably well explained. my only grumble is that we do seem to revisit the details of the hardware fairly regularly as the details are again presented here, although we go into a deeper dive in the configuration. One surprise that I’d not picked up on is that Oracle have made their NoSQL solution available as open source, although a little digging might contribute to why as it has links back to Sleepycat’s BerkeleyDB that Oracle acquired (more here). As the chapter move through the physical aspects of the deployment it also highlights in clear terms any constraints Oracle imposes to ensure that the whole appliance is supportable, the most significant of these areas is the advanced networking that is setup.

Chapter 5 as it moves through deployment considerations addresses the means to know that the appliance is running properly – so we’re talking about system monitoring not just of the hardware but the distributed nature of Hadoop and Map Reduce. So a brief view of the products deployed is given. Obviously this centres on the Enterprise Manager extensions, but also the component level tooling such as Cloudera’s Hadoop Manager.

Chapter 6 in many respects continues building out the view of Hadoop to describe briefly the analytics tooling both in the Oracle RDBMS, R language and data mining/discovery of Endeca. The interesting points in the chapter are about the relationship with RDBMS particularly as an enterprise data warehouse – something I’ve not seen really addressed elsewhere as the common world view seems to put Hadoop in the same camp as NoSQL which seems to be gaining the zeal and polarity that Linux vs Windows used to have when it comes to RDBMS. But I think the book makes a good case for right tool for the right job.

Oracle’s Strategic Product View

Chapter 7 starts to drill in to how the connector package offers which consider Oracle database data transfer, combining the R language with MapReduce and ODI.

The database connector aim to provide efficiency in transferring data between Hadoop and the Oracle RDBMS over say using Sqoop to transfer data to and from an Oracle database (ODI connectors, JDBC, direct OCI etc). To fully understand the explanation of how this works you do need to understand the basics of MapReduce although as the chapter progresses the relevant MapReduce operations are elaborated upon. As the chapter progresses we start being shown configuration fragments for the different connection approaches.

The final chapter of this section of the book looks at the NoSQL database in detail, starting with high level ideas such as how NoSQL relates to ACID and BASE ideas, dropping down into significant (but valuable) detail by describing how clients are kept in sync through the use of separate threads picking up data about the data partitioning (sharding). Once the key components have been well described the chapter moves onto explain how Oracle has optimized the process to make the NoSQL as performant as is possible whilst providing a solution that is elastic in nature and highly resilient but still predictable in its dynamics.

The chapter finishes off with considerations such as installation, how it integrates with Hadoop and OBIEE.

Overall, this a very informative chapter, occasionally it feels like some of the information is being repeated but in a different structure but it isn’t the end of the world, although if you’re reading from cover to cover you need to just press on.

Part 1 of the book is reviewed here.

Oracle Fusion Applications Development and Extensibility Handbook – Summary Review

14 Monday Apr 2014

Posted by mp3monster in Books, General, Oracle, Technology

≈ Leave a comment

Tags

applications, book, extension, fusion, Fusion Applications, Oracle, Oracle Press, review

So having written a series of detailed blog entries reviewing a couple of chapters at at time I thought it might be worth just providing a very brief review. Writing a book that provides both breadth of coverage for a very large subject area as well as meaningful depth is a very difficult trick to pull off. But the authors of this book have succeeded magnificently. The book tackles the subject of basic customization that users can perform through to in-depth feature development using the Oracle SOA stack. Not to mention reporting and analytics. The book has been written in an engaging way providing context, background and Fusion Application principles and then taking examples of how to implement the different kinds of capabilities. From this book, you should have a good grasp of what to expect and how to approach Fusion application Extension work.

As a result I’d recommend this book to Architects, Project Manager’s who want to understand what their development team should be doing and the risks of their approach. This would also form a good roadmap into the detail for developers starting out in the Fusion applications space.

Detailed reviews can be seen at:

Oracle Fusion Applications Development and Extensibility Handbook – Chapters 13, 14 & 15 reviewed

12 Saturday Apr 2014

Posted by mp3monster in Books, General, Oracle, Technology

≈ 1 Comment

Tags

ADF, applications, book, fusion, integration, look and feel, OER, Oracle, Oracle Press, review, scheduler, Scheduling

Our final detailed visit to Oracle Fusion Applications Development and Extensibility Handbook (Oracle Press) covers the final 3 chapters which engage with the Scheduler, Look and Feel customisation and the relationship with integration and service concepts (dare I use the acronym SOA).

The chapter on the scheduler is pretty short, but then compared to many other chapters the size of the product/component is small. The book relates how the scheduler behaves compared to the Schedule Management offered in EBusiness. The surprising things is that each product domain (Financials, HCM, CRM etc) has its own scheduler rather than a single shared service; the book doesn’t attempt to explain the rational here which is a shame. It does describe how it deploys into each domain, where the configuration exists and how to work with the configuration of the scheduler itself (e.g. where logging goes etc) and attempts to address some obvious questions from a administration perspective. It then goes onto how to create a custom scheduled process with a worked illustration. All very well done, although I have to admit to a nagging feeling of I’m missing something – it maybe simply that deployment is very much through server administration rather than through an automated mechanism (so if you develop and test in a preproduction environment, you can package up the process of deploying config custom app to your production environment without needing to repeat the admin UI interactions, so you can be assured there is no inconsistency between deployment instances).

The Look and Feel chapter is about largely applying the changes so that the product feels like part of your business’ corporate solution – important if you’re exposing any aspects of it to the outside world. So aside from the use of the tools you have the ADF controls to effectively ‘skin’ the product. The chapter provides a brief but concise view of how skinning works, in relation to the old EBusiness technologies (CLAF and UIX) and current HTML technology of CSS and the key part of ADF (Rich Faces). More importantly it points out the relevant documentation on all the sources of information, and tooling such as the skinning editor. Not to mention addressing the issue of deployment. Obviously there is a short illustration demonstrating an element of skinning.

ADF Architecture

The initial emphasis on the last chapter is the reality that organisations can’t simply migrate all non Fusion Apps such as EBusines, Seibel etc to the Fusion solutions in one hit therefore you need to provide a degree of integration between solutions for as long as the transition may take. This neatly leads into the question of well how do I know what components exist to support integration, which brings OER (Oracle Enterprise Repository) into the picture. So obviously the book provides a brief overview to the use of OER. The various Fusion apps offer different interfaces for different tasks (from bulk data export to business events) so each of these ‘patterns’ are briefly explianed and as Fusion apps is offered as a SaaS solution how that might impact the ‘pattern’ availability. The chapter finishes by walking through the use of using a SCA Composite and web services to interact with a Fusion App – probably one of the most common approaches to integration at a transactional (rather than bulk) manner. The only thing missing for me would be a brief discussion on Process Integration Packs (PIPs) which leverage all of the technologies underpinning Fusion Apps into a custom package of integration operations or ready made integrations.

So the final chapters provide a strong close to the book continuing to offer an excellent overview, pointing you to resources to ‘deep dive’ as necessary.

Previous Chapter reviews:

Oracle Fusion Applications Development and Extensibility Handbook – Chapters 11 & 12 reviewed

08 Tuesday Apr 2014

Posted by mp3monster in Books, General, Oracle, Technology

≈ 2 Comments

Tags

applications, book, fusion, OBIEE, Oracle, Oracle Press, Oracle Transactional Business Intelligence, review

We continue on in our review of Oracle Fusion Applications Development and Extensibility Handbook (Oracle Press) to chapters 11 and 12 which look at Reporting and Analytics respectively.

Reporting in Fusion Apps is based upon OBIEE rather than vanilla BI Publisher against the application database. This means that you and build your reporting capability against a far more diverse set of data sources (license permitting of course). It does also mean that the steps for creating reports at least to start with are more complex as OBIEE realizes a multi-tier approach to report generation. The chapter goes onto to describe the types of data source, the means by which reports can be configured conditional execution and then through ideas such as ‘bursting’ where the report generating process can be partitioned and run in parallel by multiple processes each concentrate on a range of data (sound a little like Map Reduce doesn’t it). Finally how to format the output. All of which is then supported with a detailed illustration. As you might imagine there are prepackaged reports and templates, so loading and configuring these in an environment is considered.

The book recognises that in a single chapter you can only really scratch the surface of reporting and makes reference to other tools in the OBIEE kit bag such as OTBI (Oracle Transactional Business Intelligence) BI and Mobile BI composer. The only little trick here is the opportunity to point out some good sources of information. But that isn’t a significant, there is such a thing as Google and it might take a bit more reading to find the best resources around these tools.

Chapter 12 looks briefly at the use of Analytics through OBIA (Oracle Business Intelligence Applications), Oracle Hyperion (also known as Essbase) that is available with Financial Reporting studio and focuses on OBTI. The chapter feels pretty standalone from the preceding chapter on reporting – which when using the book more as a reference is great, but from a cover to cover read can niggle a little, particularly when both chapters rely on OBIEE background. But to be honest we are nit picking here. As with previous chapters there is an illustrated scenario walked through (the layout of which isn’t as good as previous chapters – but it is a relative observation), the illustration perhaps misses the opportunity for a killer blow of referencing the core app customisation to show how you might bind the dynamic reporting provided by OBTI view into the core CRM with the customisation. I have to say I am impressed by the OBTI technologies given the integration into the Fusion security framework, leveraging ADF and its optimisation strategies – all of which are clearly explained here.

It would have been nice to explore OBIA and Oracle Hyperion a bit further, but doing so would probably have warranted additional chapters. Overall a good chapter again, covering a lot of capability efficiently.

OTBI Architecture

Previous Chapter reviews:

Oracle Fusion Applications Development and Extensibility Handbook – Chapters 9 & 10 reviewed

08 Tuesday Apr 2014

Posted by mp3monster in Books, General, Oracle, Technology

≈ 3 Comments

Tags

application, book, BPM, BPMN, development, EDN, fusion, Oracle, review

Back to the the review of Oracle Fusion Applications Development and Extensibility Handbook (Oracle Press), Chapters 9 & 10 take us from developing ADF based extensions to BPM and developing capabilities using a lot more of the SOA based building blocks such as Human Workflow.

The BPM chapter isn’t huge as actually the real effort behind BPM driven processes are more SOA based development. But the book does step back to explain Oracle’s history in the BPM and BPMN space and how Fusion Apps work using these technologies. So what we have is a good chapter more focusing on ideas and principles.

Chapter 10 naturally takes us into building full extensions which could be implementing the activities needed to realise a BPMN processes. The chapter is almost two separate halves, the first being the ideas and approaches adopted by Fusion Apps – such as the triggering of processes through EDN and onto into approval framework and how it compared to the preFusion products. The second half of the chapter turns all of this on practical steps in the various tools to realize functional extensions in a series of comprehensive steps.

Finally the chapter tackles the issues of deploying the customisation and the implications to patching and updating your Fusion Apps.

So yet again the authors have managed to cover a lot of ideas very effectively providing sufficient insight that you should able to find the necessary information if you’re working with a Fusion application not discussed here.

Previous Chapter reviews:

Oracle Big Data Handbook – Part 1 Reviewed

07 Monday Apr 2014

Posted by mp3monster in Books, General, Oracle, Technology

≈ 4 Comments

Tags

analytics, Big Data, book, Cloudera, NoData, Oracle, Oracle Press, press, RDBMS, UK Oracle User Group, unstructured data

As a result of my involvement with the UK Oracle User Group I have been given the opportunity to review Oracle Press’ Oracle Big Data Handbook. I have to admit that I am not a Big Data expert (and reviewing this book was an opportunity to build my knowledge a bit more).

So, Chapter 1 starts providing a brief but succinct history of Big Data (from Google’s work with Map Reduce and lesser known technologies such as Swazall and Dremel), the rise of Hadoop. The primary value proposition of Big Data is briefly explored (highlighting the point that actually RDBMS such as Oracle can accommodate lots of data when in a structured form) but Big Data is the nexus of volume, speed, variety (multiple structures, semi structured and unstructured). The book does suggest that in addition to these factors the data Value (a structured transaction have a lot more value than the same quantity of unstructured data which delivers its value when in context with other data).

From here, a brief look at the Oracle BigData landscape which leads nicely to having a layout for the chapters of the book. Ranging from the Oracle Engineered Systems idea to it’s adoption Hadoop through Cloudera, NoData and onto how this becomes a joined up solution with the likes of OBIEE. Passing through Oracle’s extended version of the R language.

In all a brief, succinct and informative intro.

Chapter 2, takes us on the journey of the business value of Big Data ideas, taking us through some examples such as MCI’s campaign the 1990s to develop insight by mining for friends and family information. In its day we called this sort of thing data mining, now its another aspect of big data. The chapter moves onto describing an idea of Information Chain Reaction (ICR) – where output from one stage produces a response in the next. With communication, change and connection being the primary triggers.

The authors make an interesting point, in the book about taking the metrics for volumes of traffic on social sites with a pinch of salt, not because of the possibility of overstatement (although that is a possibility, after all users is an easy measure for investors) but how and when the measurement is done, and even just changes in API or user process. For example adopting an approach that drives users to just reverify their details regularly could create more user activity although deliver no more real information. Most importantly what is the value of the information/traffic to you.

I also love the fact that the book uses quotes from famous individuals to emphasis points, for example:

The temptation to form premature theories upon insufficient data is the bane of our profession.

– Sherlock Holmes

Oracle Fusion Applications Development and Extensibility Handbook Chapters 7 & 8

01 Tuesday Apr 2014

Posted by mp3monster in Books, General, Oracle, Technology

≈ 4 Comments

Tags

ADF, book, CRM, EBis, extension, fusion, HCM, JDeveloper, Oracle, Oracle Fusion Applications Development, Oracle Press, review

continuing with the review of Oracle Fusion Applications Development and Extensibility Handbook (Oracle Press), Chapters 7 & 8 get into the development side of building extensions through the use of JDeveloper and the ADF framework, although this approach is not recommended for CRM if it can be helped, bu then the Page Composer is far more powerful in the CRM context.

Chapter 7 walks you quickly through the process of establishing JDeveloper so that you can get underway with the customisation. Along the way the book references the very detailed Oracle guides and shares useful tips as well (for example how to share configuration between JDeveloper instances for connecting to a Fusion apps server without having to go through reconfiguration.

As Fusion Apps uses ADF for its framework, knowledge of this is going to help you understand more easily what is going as the book is not an ADF guide and focuses upon the use of the framework providing some honest hints and observations (e.g. it is necessary to know which task flow forms the basis of any page depending upon the product the identification of this information can be easy or difficult depending on the product). The bulk of chapter 7 is focused to guiding you through 2 scenarios for customisation.

By the end of chapter 7, although a lot of information has been shared I’d have liked to have seen a couple of things addressed, how to minimise the risk/impact of customisation so that deploying a patch doesn’t clash or has minimal impact with any customisation. It is also too easy for organisations to customise a product to the point the C in COTs far out weighs the O and T. Remember CEMLI? The second aspect I’d hoped to have seen is the incorporation of configuration control of the development changes – but this probably more one of my pet issues showing.

Chapter 8 goes into the mechanics of developing your own UI within an Fusion App, covering DB table creation, business components, UI and so on including the security framework, creation of workflow elements and so on. I have to admit that I found this chapter easier, than the pure customisation work of chapter 7 – although that could be because the whole mechanism is a bit more discrete.

Neither chapter really take on the question of testing (integration or unit level) – I’m sure that given all the good guidance here, that the authors have a few good practises and tricks that they could share on how to make testing as simple as possible.

Aside from a couple of small points, all said and done, the book does a tremendous job of addressing an enormous subject area, and recognises that it isn’t giving you every little detail by telling you which sections of the Fusion Developers guide will provide more detailed information. Bottom line, what the book doesn’t explain you have the insight into the official Oracle online docs to go find the rest of the information (without having to plough through a 1000+ pages of developer guide).

See earlier chapter reviews at: